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PART ! 
PROJECT MAC PROGRESS REPC?T VI 
INTRCDJCTION 


Project MAC ts an M.1.T. Interdepartmental research laboratory 
for computer science and engineering. The research program of 
Project MAC is focused upon the new field of "interactive" 
computing, 1.e., computing In which men and computers’ work 
together in ciose partnership in solving problems or _ making 
decisions. Some of the research is aimed at making [It possible 
for men and computers to work together more effectively. Some of 
it takes advantage of interactive computing to facilitate the 
solution of basic problems in computer science or to develop new 
applications. BuraLmos 1 of its deals with men and computers 
-- hence the acronym "MAC", (There are other expansions of 
"MAC's those we use most often are "multiple-access *omputers" 
and "machine-aided cognition". 


During the last year, Project MAC has accomplished several 
research objectives, and it has gone through part of a major 
transition. Inasmuch as a brief account of the transition will 
set the stage for a discussion of the research results, fet us 
begin with that account. 


The Course of Project MAC 


tn order to bring about a partnership between men and computers, 
one must make It possible -- and economically practicable -- for 
men and computers to work together directly and effectively. The 
first major undertaking of Project MAC, beginning in 1963, and 
building on early work of the M.1!.T. Computation Center, was to 
create a computer system with which many people could work 


simultaneously and conveniently. The result was the first 
general-purpose multt-access computer system, CTSS, the 
Compatible Time-Sharing System. (Actually, two CTSS's’ were 


built; one was operated by Project MAC, the other by the 
Computation Center.) 


By 1965, CTSS was the focus of a estgnificant part of the 
intellectual effort of about 200 research people at M.i.T. and 
quite a few at other universities. 't was obvious” that 
interactive computing opened new horizons. The present computer 
"time-sharing" industry, with its 150 to 200 service companies, 
is one of them. CTSS made it evident that a multi-access 
computer could provide the communication as well as_ the 
information storage and processing facilities required for _ the 
emergence of a new kind of intellectual community. Partly by 
having them and partly by iacking them, CTSS made it possible to 
see what characterlstics and features a "community™ multi-access 
system should possess. 
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The effect of CTSS upon Project MAC was. threefold: its quick 
success, together with the flooding-in upon the designers of 
ideas about how to bulld a far better system, led to. the 
determination to create a truly comprehensive communtty computer 
system, and, with the General Electric Company and tke Bell 
Telephone Laboratories, Project MAC Initiated a major effort 
toward that goal. Not all the computer scientists and engineers 
who contributed to the development of CTSS wanted to participate 
in the development of the more advanced system: many turned to 
research in other aspects of computer science and engineering. 
At the same time, CTSS tended to bring its community of 
substantive users -- people interested in using Interactive 
computing to facilitate research in their various flelds -- Into 
Project MAC. Thus Project MAC came to have three main parts: 
(1) the Computer System Research Group, developing the new 
multt-access computer system, (2) several smaller research 
programs In computer science and engineering, and (3) a_ large 
number of users of CTSS. 


During 1967 and 1968, as the use of CTSS turned from a research 
experfence Into an operational routine, and as the development of 
the new multi-access system required an increasing fraction of 
the available funds, the third part dwindled. This past year, 
the CTSS operated by Project MAC was transferred to the 
Information Processing Center (formerly Computation Center), and 
Project MAC became a two-part laboratory. That was the first 
part of the transition. 


The second part of the transition began on the first of January 
1963. The new multi-access system, called the Multiplexed 
Information and Computing Service (Multics), had proven to be 
much more complex and difficult than anticipated, and its 
development had proceeded at just half the scheduled speed -- 
until the beginning of the new year. Then everything began to #0 
well, the schedule quit slipping and now (July 1969) the heart of 
Multics beats, the Multics Operating System operates. 


The Multics that now "runs" is a bare-bones system, but ai 
advanced one. The Computer System Research Group ts confident 
that It will be opened for general use on 1 October 1969. That 
will begin the third part of the transition. The main system 
research effort will turn from the operating system to the 
comprehensive lfbrary procedures and data required to convert 
such a "system" into a "service", to make it effectively and 
conventently useful. to others than computer buffs. This 
service-building effort will lead, we believe, to a_ realization 
of our now long-dreamed dream of an "on-line" intellectual 
community. But It will not bring the whole community into 
Project MAC am only the service builders. Operational 
responsibility for Multics will be transferred to the Information 
Processing Center. 
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Characteristics and Features of Multics 


Inasmuch as the demonstration of the Multics Operating System was 
one of the main accomplishments of the past year, it is an 
appropriate time to explain the basis for the claim that Multics 
provides a new and unique facility for a true on-line community 
of users, a foundation for unprecedented teamwork in many kinds 
of undertaking that are based upon information. There are two 


main areas in which Multics represents a major advance. in the 
first of these areas, a few other systems, developed since the 
desien of Multics was published, are comparable to Multics. In 


the second, to the best of our knowledge, Multics is unique. 


First, Multics introduces a great simplification Into the 
individual user's concept of the computer system. Heretofore, he 
had not only to remember the names of his programs and sets of 
data but also to keep In mind their sizes and, if large, squeeze 
them (all together or in subsets) into the computer's memory, 
which was pictured as a single series of little bins, each 
capable of holding one "word" of procedure or data. Using 
Multics, he thinks only fin terms of the names; the operating 
system brings into memory those procedures and data required at 
each moment and does not waste expensive memory space (which can 
be put to good purpose by other users) on a user's’ inactive 
information. Programming within the context of Multics, the user 
has available to him the equivalent of millions of plteces of 
writing paper, each just as long as he needs it to be -- but he 
need not remember how long, or where he put the pieces; he just 
names them and remembers the names. (If he forgets the names, 
Multics will of course provide a listing.) Nor does the user 
have to think about the configuration of the of the “hardware” 
computer. It may change basically, e.g., in number of processing 
untts or in number of memory blocks, through failure of subsets 
or through augmentation, but the only way that affects the user 
is to slow down or speed up the service. His working image of 
the configuration of the computer system remains the same: a 
"space" occupied by the names of procedures and data. 


Second, Multics greatly facilitates cooperation among its users. 
It makes it easy for them to work in pairs or groups, using 
shared as well as individual programs and data sets and 
communicating with one another through their consoles as well = as 
face-to-face or by telephone. To use another person's procedure 
or data, one has only to get his permission. One does not have 
to borrow a copy either overtly or within the system; two or 
more people can use the same "copy" of a procedure or of a set of 
data at the same time. Multics keeps one of them (and his 
programs) from changing it while the other (another) is in the 
process of "reading" it. Without permission of the owner, access 
to files is barred. The owner can extend permission to any 
individual or established group to use one of his files in = any 
one of several ways: "read and change", "read only", “execute 
only", and "use only through a privileged program". The Multics 
file system provides the basis for free, voluntary cooperation, 
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and also the basis for entrepreneurial cooperation for a fee: 
Multiecs will keep records of the use of certaln programs and data 
and send bills on behalf of the owner. 


Professor F, J. Cerbato, leader of the Computer System Research 
Group, has a list of 30 characteristics and features, of which 
the foregoing are a few, that define Multics. It will not be 
long, now, until we see whether they will make the expected 
difference, whether they will greatly facllitate the individual's 
use of his own information and, at the same time, turn the 
computer Into a communication network. 


Knowledge and Heuristics in the Computer 


Computers are so fast and accurate in their executicn 9 of 
Procedures that they proved themselves very useful long before 
any computer was programmed to do anything that seemed at all 
intelligent. Indeed, to combine the computer's speed and 
accuracy in applying defined procedures to specified data with 
man's abllity to formulate and evaluate Is the essential aim of 
man-computer interaction, But communication between men and 
ordinary computer systems Is so poor as to frustrate any attempt 
at teamwork. There can be little man-computer partnership if the 
computer knows practically nothing and has to be told at every 
stage precisely what to do and In detail? how to do fit. 


A large and important part of the work of Project MAC Is devoted, 
therefore, to learning how to "educate" the computer to be a 
better partner. It does no good simply to fill the computer's 
memories and stores with facts; the uneducated computer’ can 
remember but not use them. It does little good to fill the 
memorfes and stores with procedures; the uneducated computer’ has 
to be told exactly when and to what data to apply each one. The 
problem Is, essentially, to understand the process of being 
intelligent and to program at least some of that process into the 
computer. It is clear that the process involves knowledge, which 
can be defined roughly as facts structured Into a model that can 
be "run" or be interpreted by the processor, and heuristics, 
which can be defined roughtly as guidelines to solution or 
discovery. 


In Project MAC, during the past year, several accomplishments 
were made in the education of computers. In most Instances, the 
computer system was the Incompatible Tlme-Sharing System (ITS), a 
system somewhat smaller than CTSS and Multics and speclalized in 
quite different ways. You have to be a computer buff to use ITS 
-- but, If you are, you like it so much that you deo nothing to 
make it easy for non-buffs to understand. As a result of the 
educational accomplishments, iTS can -- among other things <-- 
now: 


1) Solve calculus "word problems" of the type, "A 
ladder 20.0 feet long leans against a house. Find the 
rate at which the top of the ladder its moving If its 


ail 
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foot is 12.0 feet from the house and moving away from 
the house at the rate of 2.0 feet per second." (E. 
Charniak) (But see Part I! of this Report for the 
section on Artificial intelligence and tntelligent 
Automata for a discussion of the difficulty of this 
problem, ) 


2) Look (through its television-camera "Eye") at a 
haphazard pile of toy btocks on the floor. analyze the 
scene into Individual blocks (Prof. A. Guzman; 8B. K. P, 
dorn), pick them up one-by-one (with its mechanical arm 
and hand), and stack them up to make a tower. 


3) Play Class C chess instead of Class D chess, as 
last year. (R. Greenblatt) 5 


4) Solve symbolic (non-numeric) integration problems, 
now even including problems’ involving logarithms and 
exponentials, such as the Gaussian ("error") function. 
(Prof. J. Moses) 


The systems of programs that do the foregoing things, as well as 
others now operating, have some knowledge and some heuristic 
capability, but not much. They are. successful only in quite 
special and restricted contexts. A large group, led by 
Professors M. M. Minsky and S. A. Papert, is working intensively 
to advance the understanding of such problems. 


Man-Computer Interaction 


Next to the stupidity of ordinary computer Systems, the main 


barrier to effective and convenient communication between men and. 


computers has heen the computer "console". in practical fact, 
the console used in 98 per cent of all on-line man-comouter 
communtcation these last few years has heen merely an electric 
typewriter with additional electromechanical parts to send codes 
to and receive codes from a computer. Such typewriters aye 
narrow bottlenecks. Obvicusly they had to give way to, or at 
least be supplemented by, graphic displays and non-keyboard 
computer-input devices such as pencils and microphones. 


The past year saw a marked advance in the commercial supply of 
ultra-typewriter equipment for man-computer interaction. The 
Advanced Remote Display Station (ARDS), developed by the 
flectronic Systems Laboratory in conjunction with Project MAC, 
led an important part of the advance. its storage cathode-ray 
Screen provides a non-flickering image of fairly good resolution 
and the capacity to display a= standard page of typescript 
(unfortunately at somewhat reduced size). Most importantly, it 
can display graphs, diagrams and maps, and it presents 
information very much faster than a typewriter. 


Perceiving the importance of improved man-computer interaction 
techniques to the future of interactive information processing, 
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Project MAC recently augmented its research programs in computer 
graphics and dynamic modeling. As tultics moves from its 
developmental to its operational phase, increasing effort will be 
devoted to understanding and development of those areas. 


Computer Language 


The intrinsic "language" in terms of which a computer carries out 
its internal processes is hardly a language in the everyday 


sense; it is just a repertoire of primitive operations upon 
information In its memory and upon its own configuration. Great 
difficulties arose when men tried to communicate with the 
computer in its own "machine language". One of the main 


practical advances of the early years of computing was’ the 
development of "higher leve!" languages for use in_ preparing 
programs: FORTRAN, ALGOL, and so on. Then, when such languages 
were in widespread use, people began to subject them to 
linguistic analysis and to try to understand them theoretically. 
That effort led to a clearer understanding of natural languages 
as well as computer programming languages. It is a continuing 
study, but a part of it recently reached a culmination. 


Much of the effort of the Computer Linguistics Group of Project 
MAC, led by Prof. J. M. Wozencraft, had focused on working out a 
systematic analysis and exposition of the basic concepts of 
computer programming languages on the basis of a method called 
the lambda calculus. That effort was completed at the end of 
1968. The exposition is presented in the class notes of Computer 
Linguistics (6.231) and in several publications. The end of 1968 
was thus the end of an era. Professor Wozencraft took leave of 
absence to serve as Associate Head of a division of the Lincoln 
Laboratory. The group, now under the leadership of Prof. R. MM. 
Graham, is attacking new computer language problems: languages 
for programming operating systems (Prof. Graham), languages’ that 
can be readily extended by the individual programmer or 
programming team to meet special requirements (Prof. A. Evans, 
Jr.), anda new way of formalizing computer languages’ and 
formulating and providing theorems about them (Prof. J. J. 
Donovan). 


At the same time, it Is becoming Increasingly clear that there is 
more to computer language than just language for programming. As 
the library of programs grows, the ratio of program preparation 
to program use decreases, and the languages through which people 
interact with programs ("Interaction languages") become more 
Important. In a part of this new area, the area of data 
description languages, Project MAC and the Special Interest 
Committee on Fille Description and Translation of the Association 
for Computing Machinery together recently initiated a weekly 
seminar. 
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Other Research Areas 


Project MAC is active in other research areas, also, and made 
significant advances in many of them during the past year. 
Professors J. Weizenbaum and R, R. Fenichel completed the first 
version of their computer Program, TEACH, that teaches computer 
programming, and "proved" it on an introductory class. This work 
was joint with the Education Research Center. Professors Donovan 
and M. M, Jones and their students developed an interactive 
simulation programming language, SIMPLE. W. T. Beyer found a way 
to reduce very greatly the amount of computation required in 
certain kinds of weometrical information processing; he found, 
for example, that through use of a parallel computer paradiem the 
humber of computational steps required to determine whether or 
noc two figures, represented in a grid, are connected can be made 
Proportional to the number of intervals in one dimension of the 
grid rather than to that number squared, Yarriet J. Fell worked 
out a way of dealing mathematically with topological problems 
that arise in discrete (i.e., eridlike) approximations to 
continuous spaces; in mathematical terms, she showed that the 
Nausdorff topology is the intersection of all possible” grid 
topologies. And so on. It is not possihle to do justice to them 
in a brief review. (See reports from the various groups.) 


Let us conclude the discussion of Project MAC's research, 
therefore, with a brief word about computer networks. This is, 
in our view, a new field of very great potential. Project MAC 
has initiated a new research group to work on computer-network 
problems and techniques. We are looking forward to participation 
in the experimental ARPA network, which will link multi-access 
computers in several universities. A. K, Bhushan has already 
published two papers in the new field and is planning thesis 
research in it. 


Administration of Project MAC 


Project MAC has a simple” structure: about twelve research 
groups, two computer installations, a document room, a 
pullications office, and a headquarters, It has a director, 


(now) two assistant directors, and an assistant-to-the-director 
who is the cement that holds the project together. Since Prof. 
R. M, Fano retired from the directorship at the end of the Summer 
of 1968, Prof. J. C. R. Licklider has been Director of Project 
MAC. Professor Jones and Mr. D. E. Burmaster (vide Infra) are 
the Assistant Directors. Miss Dorothea C. Scanlon is. the 
Assistant to the Director. In addition to conducting and 
publishing research, Project MAC holds seminars and Particinates 
in educational undertakings. This Past year, there were, in 
addition to the standard, general Project MAC colloquia, seminars 
in theory of computation and theory of automata, and the one 
(mentioned earlier and only recently started) on data description 
languages. Project MAC initiated two "project laboratories" 
(laboratories conducted on a research-project basis that yield 
regular academic credits), cone in software engineering and one 
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(joint with the Education Research Center and the Lincoln 
Laboratory) tn computer graphics. 


At the beginning of the Summer of 1969, Burmaster was appolnted 
Assistant Director of Project MAC for Student Activities -- and 
also Business Manager. As an M.I.T. student, Burmaster was an 
effective activist for student involvement In research and 
student access to computers. With a few colleagues, he conducted 
the campaign that led to the formation and funding of the M.I.T. 
Student Information Processing 30ard and, Jjnter alia, to the 
initiation of the Computer Graphics Projects Laboratory. Under 
his direction, Project MAC is now dedicating itself to a major 
Increase in student (including undergraduate student) 
participation. The computer fleld ts a "natural" for young 
people. They net only learn to understand computer systems 
faster and better than thelr elders; they can do sianificant 
original research soon after entering the fleld, 


For the five years of its existence preceding this last year, 
although its terms of reference encouraged diversified support, 
Project MAC was funded exclusively by the Information Processing 
Techniques Branch of the Advanced Research Projects Agency 
(ARPA). This last year, while the information Processing 
Techniques Branch of ARPA continued its major support at the 
established level, small or medium-sized additional research 
programs were undertaken under the support of the National 
Aeronautics and Space Administration (Extensible Languages, Prof. 
Evans; and a Laboratory for Research in Perception, Prof. 
Minsky), the National Library of Medicine (Features and Costs of 
Multi-Access Computer Services, Professors Licklider and 
Corbato), the Office of Naval Research (interactive Problem 
Solving and Decision Making, Prof. Jones}, and the Behavioral 
Sciences Branch of ARPA (Dynamic Modeling, Prof. Licklider). 
Where funding from these new sources (or, in the case of joint 
research efforts with other M.i.T. groups, other sources) is 
involved, note will be made to that effect in the reports that 
follow. 


A final item: Project MAC contributed significantly to the 
planning and establishment of a new research project, the 
Cambridge Project, for computer analysis and modeling in_ the 
behavioral sciences. The Cambridge Project has been funded by 
the Behavioral Scltences Branch of ARPA through the Defense Supply 
Service of the Army. The Cambridge Project will Improve and 
specialize interactive computer methods for use in 
behavioral-science research. 
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1. COMPUTATION STRUCTURES 
Objectives -- Jack 8. Dennis 


The Computation Structures Group seeks to develop and to study 
advanced concepts in the desizn and organization of 
Zeneral-purpose computer systems. We wish to understand the 
influence that technology, economics, programming languages, and 
the nature of computer use should have on the architecture of 
computer systems. Projects within the group often involve. the 
invention and study of a mathematical model from which one can 
deduce conclusions having direct implications for the structure 
of prograns and dizital systems. 


Current research by the Computation Structures Group falls into 
four main areas: 


1) Representation of parallelism in programs and 
study of correspondinz semantic theories of 
computation; 


2) Structured information -- development of a 
forinal model; basic primitive operaticns and their 
oropertics; the imnlications of security, privacy, 
and controlled sharing of procedures and data; 


3) Architecture of computer processing and memory 
hardware; 


4) Design and specification of digital systems, 
particularly in the form of interconnected 
asynchronous modules. 


The eventual goal of building a general-purpose computer of 
radical, highly parallel architecture inspires and Zuides much of 
the group's research. The desitn of this machine would emphasize 
the independence of Prograns fron the context of their 
application so that they might arbitrarity be combined without 
concern for their internal desizn. We have argued (Dennis, App. 
C) that this ability can be realized effectively only im machines 
that exploit the potential parallelism in algorithms. 


Most of the individual research topics outlined below are 
Starting formulations cf problems for doctoral research. 
Although the topics have frown from the background of aims and 
ideas within the group, each study reflects the interest and 
judgment of its author. 


Controlled Information Sharing in a Computer "tility -- Dean 4, 
Vanderbilt 


The development of multi-access computer systems through the 
principle of time-sharing has greatly enhanced the services that 
computer systems can offer their users. Large numbers of users 
ore now able to have simultaneous on-line, interactive use of 
the system facilities, thus achieving orders-of-magnitude shorter 
response times than is possible with hatch systems. These 
systems also provide facilities that enable the user to store his 
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prozrams and data within the system, rather than requiring him to 
subnit then to the system for each use. 


This ability to store information within the system has created 
the possibility for each user to make his work easily available 
to others. This is accomplished hy allowing users to. share 
information stored within the system. A user could specify with 
whom and in what manner information is to be shared, and other 
users could then make use of that information, within the 
specified restrictions, as easily as they could use their own. 


These developments have led to a broadened concept of a computer 
utility: a multi-access system with facilities for general use 
by large numbers of people. tn addition to making increased 
amounts of "“conputinz power" available, the computer utility 
would be a vehicle through which users could make their work 
availab!e to others in the user community. Therefore, proviston 
of convenient means by which users may share, In a_ controlled 
way, information stored within the system Is e primary objective 
li the development of the computer utility. 


The present research is an attempt te understand more thoroughly 
the requirements that information-sharinz places on the design of 
a computer utility. The approach is to specify a model of those 
parts of a conputer utillty that retain and access programs and 
data structures on behalf of users, and that control access to 
and sharing of these objects. Emphasis in the model is placed on 
the necessary logical organization of the stored information and 
the means provided to users’ for accessing and allowing other 
users to access the informatton. The following discusston 
illustrates the methods being employed to jevelop the model. 


We accept the foilowing principle as_ hasic: Access to 
information Is granted to a recipient user only as necessary to 
enable the accomplishing of sharing that has heen explicitly 
allowed by a donor’ user. One obvious consequence of this 
minimum-access princiovle ts that information that has not heen 
specifically shared will be accessible only to the user. who 
created it. There are other consequences which we will discuss. 


Let us examine the types of Information that it shouid he 
possible to share. First, {ft should be possible to share the use 
of programs. This Is Implict!t in the concept of building on the 
work of others. There are two consequen:es of this idea. 
First, because of our minimum-access principie, it must be 
possible to permit the execution of a program without forfeiting 
knowledge of Its structure. !t seems clear that, unless this is 
possible, the Idea of leasing software is not feasible. If it 
Is necessary for the program user to be informed of the program's 
structure, this will be In effect purchasing the program rather 
than paying for Its use. 


The second consequence of sharing the use of a program Is that it 
must be possible for the borrower to execute the program. 
Generally, the execution of a program will require the use of 
various data and the execution of other prozrams (subprograms). 
Some of this Information will be supplied to the program by the 
caller. But the remainder will have heen snecified by the 
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creator of the obrogram, and the system must ensure that it is 
nade available during the program's execution. A common example 
of this second type of information is the subroutine library. tn 
this case, the program creator specifies the subroutine by 
utilizing a special name for it. The system then recoenizes this 
name, and makes the specified subroutine available whenever the 
Program is executed. Usually, this information can be supplied 
by the system, or specially constructed by the prograin creator, 
or borrowed by him from some other user. 


In each case, this additional information must be made available 
to the borrower of a program when the program is executed, 
dowever, in accordance with our guiding princtple, this 
information should be associated with the main program so that 
the use of this ability is directly associated with execution of 
the main program, 


The need to form these associations leads us to the concept of a 
procedure, For our purpose, a procedure is. an entity of 
structured information consisting of first, a program, and 
second, a set of abilities to access other programs and data, 
These programs and data constitute the information, in edditton 
to information supplied by the caller of the Program, that may be 
required for the execution of the Program, 


4ierarchical Associative Menories -- Jeffrey L. Gertz 


Recent work by Prof, Jack 8. Dennis (Nennis, App. C) has 
indicated that two current trends in computing -- first, the 
increasing importance of parallelism in computer operations to 
improve hardware utilization, and second, the concept of 
Prozranming generality which allows promrams to be written that 
are independent of the hardware environment -- require a radical 
change in thinking about computer System architecture. In 
particular, the requirement that Programs be allowed to transmit 
arbitrarily complex information structures aS parameters’ to 
procedures with unknown storage requirements impiftes that one 
must use location-independent addressing. One possible manner of 
accomplishing this objective is to employ an associative memory. 


In this research project, we are concerned with the study, 
analysis and design of a multi-level associative memory for a 
highly parallel computer system. With such a memory, when a 
Processor needs a memory word ‘data or instruction), it presents 
a key to the memory system (rather than an address, as is 
Presently the case) which uniquely specifies the word, The 
memory system then locates the word and brings ft to the sphere 
of the processor, 


3efore one can propose detailed operation of the memory system, 
he must know two fundamental facts. First is the nature and 
representation of the information to he stored in’ the memory; 
second, the physical structure and Organization of the units that 
comprise the memory system, 


A program will consist of two. parts: one or more Pure 
Procedures, and, for each of these, an associated data area. 
Each of these items will be organized and stored as an 
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structure. In particular, each pure procedure will 


information 
be represented as a precedence graph in the manner of Martin and 
Estrin (see Ref. 1 at the end of this section). 


Each information structure will he represented in the memory by a 
collection of pointers. Traditionally, in addressable memories, 
downvard pointers are used for the type of operations we shall 
consider. Yowever, in this research, we shall present) an 
implementation using upward pointers (from son to father). We 
shall show that, for an associative memory, this method is 
superior to those considered before. The physical memory will 
consist of several levels. Because of cost and size limitations, 
probably only the hishest level will be truly associative -- the 
others will be traditional types. Therefore, we must consider 
the problem of making core and drum memories look associative. 
Possible methods are hash-coding (core memory) and associative 
queues (drum memory). we shall also examine the usefulness and 
effectiveness of using modular memories in this work. 


3ecause the memory system will consist of several physical 
levels, the question of transfer of information between them 
arises. Others, especially Dr. Peter J. Denning (Denning, TR-50, 
App. 8), studiet this problem. We consider the possibility of 
using dynamic "pages" instead of the usual fixed pages, since 
this concept appears to fit in well with the use of information 
structures to represent programs. In this type of 
implementation, each pare to he hrought into a higher-level 
memory will be formed when it is demanded and it will not have a 
fixed size. 


Finally, after the memory system is specified, we shall study it 
analytically. We shall examine the queueing characteristics of 
modular memory systems and inter-level memory queues. We shal] 
simulate several parts of the proposed system to assess their 
performance. We shall compare the system with more conventional 
ones according to several performance criteria. 


Aodular Associative Memories -- Jeffrey L. Gertz 


We have begun a study to analyze the behavior of a moduiar 
associative memory in a parallel processing system. 
Characteristics of the assumed system are: N statistically 
identical processes active at -11 times; sufficient processing 
capacity that no process need wait for instruction execution; and 
a common associative memory consisting of M (< N) tdentical 
modules. 


Each process Is assianed one of the modules as its orimary store, 
in the following circumstances: 


1) When the process needs a word from memory, it 
looks first in tts primary module; 


2) When a word belonging to the process is brought 
into memory, tt Is placed in its primary module 
(at any time, every word in memory is assumed to 
be uniquely owned). 
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Statement 2, of course, is the reason for statement 1. 
With these conditions, we assume the:, when a_ process 
makes a memory request, it finds the word in its 
primary store with probability p, in another module 
with probability q (due to inter-process sharing or use 
of common information), or in a lower-levei memory with 
probability 1° p= q. The values of p and q will, of 
course, be a function of the number of modules (larger 
4 means smaller 9p), but remain fixed if the over-all 
size of the memory is also fixed. Finally, we assume 
there is a number q_ that gives the fraction of memory 
requests by a process that represents sharing (and 
hence q ¢ Io): 


we assume sequence of actions by any process is: 
1) Generate a memory request. 
2) Search its primary store for the word, 


3) If unsuccessful, continue searching 
module-by-module for the word. 


4) If still unsuccessful, locate the word in a 
lower-level memory. 


5) Enter a processing period, the length of which 
is a geomgtrically distributed random variable 
with cane 


6) Return to step 1. 
Steps 2, 3, and 4 nay involve waiting in queues. 


“We have developed an analytical solution of the above system -- 
giving the number of requests handled per memory cycle (R), 
average number of processes heing processed at any time, and 
other parameters -- under the assumption. that circulating 
requests are handled before new requests at any module. We have 
studied both methods of priority assignment with aie simuletion 
model. 


We have found two preliminary results. First, several test runs 
with the simulation model appear to indicate that both priority 
assignments give the same rate of request-handling capacity. The 
circulating first algorithm results in fewer queues because there 
are no inter-module queues; it appears to be the superior method 
of operation. Second, an optimal number of memory modules, whose 
value is a function of various system parameters, is generally 
well below N. As one might expect, both this optimal value = and 


-the system through-put increase as inter-process_ sharing 


decreases. The figure gives a sample of this behavior for _ the 
values indicated, We hope to derive formulas that will allow 
determination of the optimum number of memory modules to use for 
determining the corresponding system through-put In terms of a 
few Important system parameters. 
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Jizital Computer Urganization -- Lawrence Seligman 


Ne are studying organization of very-high-performance digital 
computers in the form of time-independent interconnections of 
asynchronous modules. Two goals are paramount: to demonstrate 
the suitability of asynchronous structure, and to develop 
corresponding analytic tools capable of characterizing system 
erformance. 


The existing very-hish-speed processors are essentially 
synchronous designs. While some :f these systems appear to 
employ asynchronous communication among the major components -- 
memory modules, processors and 1/0 channels -=- each of these 
components is implemented as a synchronous” system, The 
suitability of asynchronous losmic -- for example, Prof. Fred L. 
Luconi's computational structures (Luconi, TR-49, App. B) -- 
seemed an open question at the heginnine of this study. We have 
developed an asynchronous model for a very-high-spee1 processor 
which includes provision for representing multiple independent 
functional units and dynamic allocation of processor hardware for 
the storaze of intermediate results. We are extending the model 
to include micro-prozranming facilities and hierarchical memory 
orzanizations. 


The study of such models is fruitful because they are amenable to 
analytic techniques. Dr. Denning's recent work (Denning, TR-50, 
App. 3) on resource allocation has been adapted to study the 
performance of a hierarchical inain memory system, eliminating the 
need for evaluation by extensive simulation. 


Modular Jesign of Asynchronous Arbiters -- Suhas S$. Patil and 
Jack 3. Dennis 


In computer systems, it is advantageous to arrange similar 
resources into pools from which units may be allocated as needed 
to serve on current request. For example, the processors of a 
mutti-processing system are often pooled for use by many 


processes. An arbiter is a device that resolves potential 
conflicts in the allocation of resource units to meet nearly 
slinultaneous requests. Synchronous arbiters are used in large 


processors such as the CNC 6690 (Control Data Corporation), but 
they are complex, intertwined with other mechanisms of the 
machine, and are therefore very difficult to design and debug, 
Asynchronous arbiters are frequently used to control access to 
multi-port memory modules. These are sinezle-server arbiters, are 
not usually time-independent, and are not modular in their 
internal construction. 


We are investigating the desisn of general arbiters that are 
time-independent in their operation and are of modular 
construction. We have completed a design for an n-server, m-user 
arbiter as the interconnection of a few types of basic 
asynchronous modules. Two of these module types are specific to 
the functional requirements of arbiters and may be viewed as 
additional basic modules for the design of generalized arbiters 
for situations in which an individual request may be for a 
combination of resource units, possibly of different species, 
rather than just a single unit of one type. The performance of 


the arbiter will also he investiszated. 


The work of A. We. Holt and his colleagues (Ref. 2 below) has been 
very influential on this research: the use of "Petri nets" to 
represent asynchronous systems has been particularly convenient 
for the study of arbiters. 
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2. COMPUTER SYSTEM RESEARCH 
CTSS and Multics System Development -- Fernando J. Corhato 
Introduction 


The research and early development plans of the Multics project 
were carried out jointly by members of the Bell Telephone 
Laboratories, the General Electric Company, and Project MAC. As 
Multics work has shifted into the final stages of implementation, 
BTL participation has diminished and, during the last six months 
of this period, the project has been joint between members of GE 
and Project MAC. 


July 1968 through June 1969 was the key year for the Multics 
(Multiplexed Information and Computing Service) System. In June 
1968, Multics supported fewer than eight users on a test-session 
basis. By July 1969, the system was supporting at least 18 
users, doing heavy system programming work, on an 
around-the-clock basis. The group has established 1 October 1969 
as the date for opening the system to use by the general .1.7. 
user community. Again this year, the primary efforts of the 
group were aimed at increased functional capabilities for the 
system and for improved performance, in terms of both the number 
of users the system can accommodate and the responsive service 
each user gets. 


The next main section constitutes a brief technical review of the 
steps involved this last year in bringing Multics to operational 
status. 


Functional Capabilities 


DIM Benchmark. The group achieved what has been called the 
Demonstrable Jnitial Multics benchmark in October 1968. The DIM 
systein was defined by its ability to support at least eight users 
for several hours at a time. Under DIM, group members could use 
the system for productive work (i.e., instead of merely allowing 
system testing, DIi4 allowed creation of new modules and debugging 
on the system itself). However, users still did compilation = and 
assembly under the GECOS (batch) system. Additions made to the 
system in reaching the DIM benchmark included: 


1) Error recovery from the "hardcore supervisor", 
so that one need not invariably "crash" the system 
when hardcore routines encounter difficulty; 


2) An interim version of "multilevel storage", 
which allowed files to be on disc as well as in 
drum memory; 


3) The system-shutdown facility, which allowed the 
running Multics system to be stopped = and 
re-started in an orderly way; 


4) An interim back-up facility, which dumps~ disc 
and drum (and permits subsequent jin toto 
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i : . by January 1968, the group had achieved the 


Limited initial Multics benchmark. Lit! went beyond the DIM 
system in two basic ways: in performance, LI!t supported 12 
users; in functiona’ capabilities, LIM permitted Multics 


development work to be (in principle) independent of CTSS 
(Compatible Lime-Sharing System). Source files were created and 
edited under Multics, compiled under GECOS, and returned to 
Multics for testing and debugging. Althoush CTSS continued to be 
used for such work on a production basis, new Multics System 
Tapes (containing self-initializing new versions of the system) 
were generated from within Multics. Features In the LIM system 
Inctude: 


1) A basic User Control module, which performs 
login and logout functions and allows one to 
"quit executinz a process; 


2) Performance improvements in the Command System, 
including a faster Shell (the command language 
interpreter) and Listener (which manages 
communication between a user's console and _ the 
rest of the Cormimnand System); 


3) Performance improvements in the Traffic 
Controller, lacluding a more efficient 
Interprocess Connunication facility. 


At. the LIM benchmark, system testing and maintenance had been 
orzanized so that weekly new systems incorporated new development 
work and the fixes for recently analyzed bugs. 


New File System. With the accomplishment of LIM, the system 
reached a point where formal definition of further benchmarks was 
inappropriate; “backages" of chanzes comprising discrete 
benchmark systems were no longer needed; independent functional 
areas came under Intense= scrutiny. As it has always heen in 
Multics, the most import-nt functional area was the File System, 
The eroup began fa January to work for a complete re-evaluation 
and re-working of the extremely tightened coding and program f low 
modules which comprise the "New File System", Major poals were 
to speed up missing segment fault-handling, linkage 
fault-handling, new process creation, and general directory 
manipulation. The New Flle System effort adhered closely to 
schedule. The desisn was stabilized in January, firm schedules 
were set by early February, and by March a test system based on 
the New File System had reached command level (i.e., successfully 
completed system Initialization). By May, the New File System 
had been Integrated with the standard system, Its 
pre-installation check-out work was sufficiently thorough to 
eliminate the need to "back up" to a previous standard system 
because of post-installation difficulties. 


“Gi . The next major functional area is 
input/output. In early February, introduction of the Mini-GIM, a 
streamlined Interface module, into the General 1/0 Controller 
(GI0C), considerably speeded up the typical 1/0 processing path. 
The mini-GIt, less elaborate than the full GIM (GIOC interface 
Module), permits high-efficiency performance of hasic 1/0 
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operations. [fore elaborate operations can still be performed 
through the full GIM. With the mini-S1IM, the "typewriter DIM! 
(Device Interface Module) -- the module that is the major link 
between the users' consoles and. the system -- was moved into 
"ring O” (the supervisor's protection ring). This further 
speeded up 1/0 because ring-0 modules tend to be paged out less 
frequently and because "wall-crossings" (i.e., changes of 
Protection ring) are minimized. An even greater advantage is 
that the change requires Only one input buffer for all 
typewriters, instead of one input buffer per typewriter. 


2 r Self-sufficiency is an important design 
capability of Multics. To achieve it, and thereby to become 
independent of CTSS, two requirements had to be met: First, 
Multics System Tapes (mST's) had to be senerable within § Wultics 
itself. This ftinplies working Multics facilities for magnetic 
tape 1/9, for segment-binding, and for editing of MST's. Second, 
the source files for the entire system had to be conveyed from 
CTSS to Multics, an involved administrative and operational task. 
3y the end of June, both were accomplished, and the only Multics 
work stili on CTSS was in Tanguaze development. It was left on 
CTSS simply to avoid overloading the Hultics machine. 


‘ The group developed and installed several 
important tools in the area of system reliability during the 
year. The crude back-up procedure of dumping the entire contents 
of drum and disc nemory for in teto reloading was supplanted by 
an “incremental dumping" procedure. Within a few minutes after 
new files are created, they are written out an magnetic tape. If 
it is necessary to retrieve particular files, they are reloaded 
selectively. incremental dumping greatly reduces work losses 
from system crashes. 


The second major aid to reliability is a program called the 
"Salvager", After a system crash, most of the contents of the 
storage hierarchy are’usually still intact and usable. Following 
an "emerzency shutdown"! Program, the Salvager can inspect the 
hierarchy, effect repairs to files which were being read or 
written, and make it unnecessary to perform a complete reload. 
For those cases when it is necessary <%o perform a complete 
reload, a variant of the incremental dumper is available to do 
Periodic complete dumping of the storage hierarchy. Thus 
reloading takes place on a per~segment basis instead of the 
per-storage-location basis implied by the entire-contents-dump 
approach. Reloading is therefore not dependent on the 
availability of the same hardware configuration that was used 
during dumping. 


PG; User Control, with the associated 
interprocess Communication (IPC) facility, is another major area 
in’ which there was considerable revision and functional 
expansion, User Control includes: 1) the Answering Service, 
which automatically responds to system dial-ups; 2) the login and 
logout modules; and 3) an automatic logout capability. Automatic 
logout is a particularly valuable feature; it allows the system 
load to be adjusted selectively and console sessions to he ended 
gracefully. IPC is important to numerous other areas of the 
System in addition to User Control, for it is the prescribed 


PAGE 26 


means of allowing processes to communicate with one another (see 
also "Jaemons", below). 1PC was thoroughly re-designed and 
re-coded. The new packaze is much more efficient than the 
previous version. 


Other Segments. Overhead in a Multics process can be 
considerably reduced if the number of segments it contains can be 
reduced (see Project MAC Progress Report V, 1967-1968, p. 39, 
regarding the desirability of hinding object-code segments 
together). File directories would occupy less storage space if 
they contained fewer seements. Therefore, a plan to combine the 
previously independent object-code, linkage and symbol segments 
resulting from the translation of a given source-code segment 
into a single "ohject segment" was put into limited practice 
toward the end of June. The necessary changes were made to the 
system for it to produce and manipulate object segments and, 
althoush no massive change-over has been’ scheduled, object 
semnents are propagating through the system as various modules 
are revised and recompiled. 


"Na3emons". A "daemon" process is a system (not a user) process, 
the operation of which is automatic. Several daemons have been 
developed during the year. Particularly helpful daemons 
installed were the "Output Driver Daemon" and the "EPL Daemon". 
The former accepts requests (via commands which employ the IPC 
facility) to perform card and printer 1/0. Such requests are 
then stacked on to queues so that the demand for peripherals is 
smoothed out over time. The latter makes possible queued 
on-line execution of the EPL (Early PL/1) compiler and the EPLBSA 
assembler, an important advance because these are the current 
syste: linplementation lanzuages. (A full version of PL/1 will 
replace EPL; see "Other Efforts", selow.) Execution is queued 
because of the drain on system resources which the current, 
interim languages represent. Thus, with only one 
conpilation/assembly in progress on the system at a_ given time, 
other users are able to perform other useful work without 
degradation of system response. 


Performance and Reliability 


idultics supported fewer than eight users for a few hours’ in 
September and more than 18 in 24-hour-per-day operation by June. 
The "more than 18" users involves two considerations: 


1) Because all the users are systerns programmers 
who use the machine very heavily, the somewhat 
arbitrary limitation of 18 users was established 
to keep system response from becoming too 
sluggish. (The system programming tools -- in 
particular the EPL compiler -- are not. so 
efficient as the programs that typical users would 
use.) 


2) The hardware "data switch", which allows 
consoles to dial in to the system, imposes a 
temporery physical limitation on the number of 
users. Thus, although precise figures of system 
load under typical usage conditions are not yet 
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available, the Over-al] state of system 
performance is quite encouraging, 


Performance improvements were effected in two ways. First was 
the application of Programming techniques of re-design and 
enlizhtened re-coding (for discussion, see Project MAC Progress 
Report V, p. 4&0). All the major functional areas discussed 
above (as well as the numerous. minor areas) were given the 
benefit of these Programming techniques. The most basic area was 
the File System. The pay-off was quite high In this area, with 
the time required to process all functions improved by factors 
ranging from two to four over the final version of the "old" File 
System. Preparation for the Yew File System effort included 
definition of a subset of EPL for system programming. This 
subset specified those constructs In the language that should be 
employed to obtain the most efficient object. This "Restricted 
EPL" subset was adooted for al] system programming except fer 
those user interfaces where adjustable-length character-string 
manipulation was unavoidable. Recourse to hand-coding to further 
improve performance remains necessary only in a very few modules. 


"Tuning" was the second major system improvement. Tuning jis 
primarily alteration of various System parameters to achieve 
better usage of resources, Among the system parameters tuned 
were the value of the "quantum" or time-slice allotted to each 
Process, the strategy of paging, the +rules of eligibility to 
compete for the available processor(s), and the scheduling 
algorithm, 


Members of the group developed and employed three tools to 
monitor performance as a basis for Planning the tuning. Work on 
a Master's thesis research Project involved initial development 
of a system on the PNDP-8 display computer which allows on-line 
Performance monitoring. A "Certifier", devz'loned as a Multics 
command, creates a set number of Processes, each of which 
executes commands from a file or "script" of typical commands. 
The Certifier allows comparison of the performance of successive 
systems, in both individual and over-all time consumption. 
Finally, a page-fault tracing tool allows deep analysis of paging 
behavior on a per-process or on a per-system basis. 


Reliability. Multics went into operation three hours per day in 
September 1968, The user community was small because of poor 
System reliability and generally limited access, For the next 
three months, very little real work was accomplished solely on 
Aultics. 


oy February, "mean time between failures" (hardware and software) 
had risen to two hours and the system was in operation about six 
hours per day. Users were beginning to exercise the System in a 
ay no automatic testing could. Software bugs occurred less 
often and were more difficult to find and correct. 


"Aultics in Operation" rapidly increased to 24 hours. per day 
during the week and over 12 hours per day on weekends. During 
the first week in April, Multics was in continuous operation 
without failure for over 30 hours hefore it was successfully 


shut down. Mean time between failure varied widely, with an 
average of about four hours, 


Since the beginnine of Way, Multics has been operating all the 
time the system could be kept "up", 24 hours per day, seven cays 
per week, Installation of the ‘ew File System increased 
performance to the point where 12 system programmers could use 
the system with reasonable respanse -- and as many as 19 users 
were logged in at one time in June. (The limit was set by the 
availability of Dataphone equipment.) During June, the averare 
number of users was between 10 and 12 for the hours 8:00 A.M. to 
inidnizht and between five and six for the hours midnitht to 8:00 
A.i4, Mean time between failures still varied considerably but 
average was somewhere over eizht hours. System usage per week is 
summarized in the adjacent fizure. 


Other Efforts 


Languaze development was the Primary area of work not directly 
related to the main. stream of development. Members of the 
Cambridge Information Systems Laboratory of General Electric 
worked on PL/1 and Fortran compilers. The PL/1 effort is 
Particularly interesting because, when it becomes available, it 
will be able to recompile system programs written in EPL. 
Indications are that PL/1 recompilation can lead to as much as 50 
per cent improvement in object-code length and to. further 
Performance improvement because of more efficient mechanisms for 


such things as character-string manipulation and 
structure-accessing. The compiler was also showing up well in 
terms of compilation time. ‘Whether PL/1 will be available when 


the system goes public jis uncertain; at worst it should be ready 
shortly thereafter, Fortran is expected initially to be the 
primary user languarze; it will be available in the tnitial public 
version of the system. Fortran has shown up well tn compilation 
time and in object-code efficiency. 


Another available languare, BCPL, is a result of work at Bell 
Telephone Laboratories, BCPL, implemented on CTSS by its 
Originator, Martin Richards, during his stay at Project MAC, is a 
language well-suited for the implementaticn of compilers. One 
compiler which has already been implemented in BCPL on Multics is 
PAL, used in an M,1I.T, undergraduate course in Programming 
linguistics. 


Yperations. During the tnitial stages of tegular Multics 
operation, it was necessary to have a programmer available 
throughout the session. The frequency of crashes or of other 
trouble, the lack of firm Procedures, and the necessity of 
catching every possible bug made it almost impossible to delegate 
the operation of Multics to the Computer Operations Staff, 


For several reasons, training operators in both the operational 
and recovery procedures for Multics was not simple. First, as fs 
to be expected in a research project, the "Operator Interface" 
was not zZiven great attention during the early stages of Hultics 
design. This area is now receiving programming support, Second, 
operational procedures chansed frequently as a consequence of 
system improvements. AS new modules replaced old, and as more 
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modules were added, operational procedures were changed to keep 
pace, and problems of communication resulted. Third, operational 
procedures remain numerous. ‘Jowever, this situation jis eased 
because particular emphasis is being placed on increased feedback 
and communication among operators, programmers and users. 


Successful operation of Multics requires that operators be aware 
of the condition of imany variables and be current In their 
understanding of procedures. Several classes and many pases of 
written inaterial have been provided to aid the Operations Staff. 
Nevertheless, this area will continue to receive attention since 
it critically affects the performance of the system as seen from 
the users' points of view. 


Wardware 


Introduction of production (as opposed to prototype) equioment 
was the major hardware development during the year. The 
necessary software changes went smoothly; no major problems were 
encountered during the change-over. There were, however, a few 
Processor bugs; the drum required more frequent cleaning than had 
been anticioated; and unexplained hardware transients caused 
trouble for approximately a two-week period before they 
disappeared, Generally, however, the production hardware has 
performed satisfactorily. 


Plans for the Future 


To open Multics for use to the general M.1.T. user community by 
Uetober 1969 is the main zoal. Plans have evolved to streamline 
those aspects of the system of most concern to the general user, 
with the aim of making simple features of the system very 
inexpensive in execution tiine while retaining the generality and 
flexibility for which the system has been designed. A "Limited 
Service System" (a subset of the present service system) has been 
defined to accomplish this. The command language and the 1/0 
System are the areas in which this system will differ most from 
the current system as used by the system programmers. The group 
has designed a "mini-Shell" that will process simple command 
lines very rapidly; the features of the full comnand language 
(i.e., the current Shell) will still be available to users who 
wish then: the mini-Shell will racoznize a oarticular character 
in the first position of a command line as an tndication to 
invoke the full Shell. In 1/0, the streamlining ts a redesign of 
the 1/0 switching complex to treat the currently desirnated 
default input and output streams as the oniy available streams 
unless the user takes action to define others, thus bypassing the 
switching machinery fn the "limited" case, The final major 
enterprise regarding the Limited Service System Its the definition 
and intensive streamlining and "polishine" of that subset of 
commands that constitutes the most frequently used sroup. 


With the Complete and Limited versions, Multics will meet the 
needs of both larre and small users. 


The concern of the Computer System Research Group with Multics 
will not end with the completion of the Limited Service System. 
Efforts will still be required to refine the hardcore system and 
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to provide additional system features. important areas of 
performance improvement inelute further tuning of the Traffic 
Controller, reduction of the number of semments (per process) 
required hy the supervisor, an{ selective pre-pagzineg when eivine 
control of a processor to a process. As the system soes into use 
by the community, new commands and subsystems will he develone4 
hy users and made avatlahle for seneral employment. 
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3, INTERACTIVE MANAGEMENT SYSTEMS 
MACALMS -- An Interactive Nanagement System 


The MACALNS project, inittated in June 1368, has two major foals: 
investigation of the value of a multiple-access computer, 
employing sophisticated interaction procedures, as an aid to 
Management; ani the specific apnlicetion of such procedures to 
the managenent of Project IAC. The work Is supported in part by 
ARPA funds through ONR and in part by ONR funds. 


B3ustIness applications occupy the e#reat majority o” computers 
installed in the Unite States. ‘owever, there has thus far heen 
surprisingly little use of Interactive comouting In management. 
The inability of most existing Interactive computing facilities 
to handte econonically che volume of data required for business 
purposes has been one reason for this. Another has been the 
requirement of most current systems that the user expend a 
simnificant amount of tine and effort to learn to use the 
systen. 


In the near future, some time-sharing systems will have creatly 
enhanced data-storar2 capabilities. Thus It will be technically 
feasible to use these systems In Interactive management. The 
question {ts whether systems can he devised that will permit the 
professional manaszer to Interact with the computer usefully and 
conveniently. This Is a question the MACAIMS project is asking. 


we spent the Summer months of 1968 studying the existing Project 


MAC management procedures. Robert Cas Goldstein surveyed 
available system-bullding tools. These studies suggested four 
applications areas for Initial lwplementat tons budgeting, 


purchasing, personnel, facilities. The Jast includes furniture 
and equipment as well as buildings and rons. We fecided to 
carry out the initial development work on the GTSS system because 
of its immediate avallabllity. However, we anticipate that other 
computers will be used in later phases of the project. All 
programing for the MACAIMS systein Is being done in the AED 
language (see Electronic Systems Laboratory section) which offers 
“oth extremely sophisticated capabilities and a= fairly hieh 
degree of machine indepen tence. 


After we studied the existing CTSS facilities for maninulat ing 
large data bases, we deci ied to create a system In which all data 
would at least appear to be resident in core at all times. This 
approach offers the maximum degree of flexibility for organizing 
the data, and it ereatly facilitates updating the data. Because 
the various application areas have data requirements that vary 
considerably In size = and structural complexity, we employ a 
variety of organizing stratezles, from simple sequential files to 
partially inverted list structures, 


The MACAIMS project Intends to develop systems that can be used 
by people engaged In management activities who have no_ special 
computer training. This Implies that the system should possess a 
rudimentary capability for natural languare conversation and, in 
particular, that it should recognize a wide range of 
business-related terminology. Therefore, before we could begin 
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programming of applications, we had to develop 93 a number of 
additional tools, prinarily in the areas of hishly interactive 
input~output, character-string manipulation and interpretation, 
and information retrieval. By late June 1969, Goldstein, Loren 
KK. Platzman, Neil tL. Ross, and Douglas M. Wells had essentlally 
complete’ the develonnent of these tools and had nade a start on 
substantive MACAIMS vorograms. 


We anticinate that [Initial working versions of some of the 
applications programs will be demonstrabte In the Fall of 1969. 
Our further work will emphasize four objectives: completion of 
the initial system; extension to other management areas; 
improvement in the method of Interaction with the user; and 
improvement in the efficiency of the vrozgrams. 


Interactive B3udgetinz System 


The interactive S3udzeting System (18S) was the focus of our 
recent research (Javid WW. Ness, App. A). wWe added a display 


capability that substantially improved the oricinal system. In 
demonstrations, we used the ARDS (Advanced Renote Display 
Q'ation) terminal at the Sloan School. Participants in the 


senior Executive Program at that School used the system in an 
experimental "game", developed with Professors Wallace B. S. 
Crowston and iichael S. Scott-Morton. 


135 involves a business inodel. We have continued to expand the 
model to cover wider areas of managerial interst. Currently, we 
are adding a costing factlity which incorporates Scott-Morton's 
work (Scott~Horton and Andrew J. McCosh, App. C). 


During the year, we also made a opreliminary study of the 
organization of binary trees frown with a sortine algorithm 
(William A. Martin and Ness, App. C). 


Econonies of Scale in Computer Use: Initial Tests and 
Implications for the Comnuter Utility -- Lee L. Selwyn 


This study concerned economies of scale in the production of 
data-processing and other computing services, and the possible 
regulatory and public-policy implications of these economies. 


The rapid development of the technology of computation since 
Norld War I! has raised many questions about the supervision. by 
public authorities of the use and progress of this technology. 
The Federal Communications Commission initiated a study in 1966 
to consider the Comunission's role in the production = and 
distribution of computing services that involve communications 
facilities, supplied by regulated carriers. The present 
investization concerns the production of computing services per 
Se. The direction that public policy takes will be greatly 
dependent on the nature of the production of computing services, 
and perhaps secondarily on the Interdependence between computer 
systems and the communications suppliers. 


The relative economies of the use of large computing systems have 
been known for some time, in terms of the relationship between 
some measure of the quantity of outnut of a machine and its cost. 
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Infeed, this study demonstrated that when one considers, in 
addition to the cost of the computer hardware itself, the various 
categories of operating expenses associated with a computer 
installation, the relative advantages of large facilities become 


even more significant. 


fet the evidence would seem to indicate that, despite these 
apparent efficiencies of large systems, most installed computers 
are fairly simall. tn an attempt to determine whether, in actual 
experience, there are no true economies of larze size, we made an 
analysis of data on nearly 10,000 computers installed at firms in 
manufacturing industries. Using the survival technique, which 
considers market experience as a basis for studying levels of 
optimum plant size, the analysis sussested that users did operate 
computers as if there were significant economies of scale fn 
their use. 


‘ione of the evitence suegested that even the larrest-size system 
available today Is the most efficient possible size of "plant", 
hence the key implication for the formulation of regulatory 
policy toward the comnuter is that such policy should encourage, 
to the greatest possibte extent, the shared use of larrme systems 
by those who require computing services. Those barriers that 
mitisate such shared suse should be reduced or. eliminated. 
Public-utility status would be indicated only {ff the costs 
associateil with shared-conputer use distribution <-- software 
development, system overhead and administration -- are less than 
the potential direct savings resulting from use of tarsze systems. 
This is at least as much a technological problem as it is 
regulatory one. The future of the computer-utility concept will 
thus be dependent upon the degree to which technology can reduce 
costs in these catezorics. 


SIMULA 


From Septenber 1968 until June 1969, Havard idegna, A Neorwegian 
visitor, was associated with the group. Ilegna had been invoived 
with the implementation of the SIMULA simulation system 
developed at the Norwegian Computine Center in Oslo. As an 
experiment, he implemented a version of SINULA on CTSS, using the 
AED ilacro system. 
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4, PROGRAHMING LINGUISTICS 
Introduction -- Robert WA. Graham 


Prozramming linguistics is concerned with formal definition of 
programming languages and with translation between programning 
lanzuages. Our definition of programming languages is a broad 
one. It includes languages ranging from nachine language to pure 
declarative or descriptive languages (i.e., lanzuagzes used to 
describe a set of functions and constraints, such as ae eset of 
differential equations and associated boundary conditions, rather 
chan an algorithm for the solution of a problem). 


The basic problem in defining a programning Janguage is the 
precise, complete specification of the language's syntax and 
seinantics. Several formal systems exist for defining 
programming languazes. Comparison of these definitional systems 
exposes hasic questions regardiiae thelr equivalence and 
hierarchical ordering in the classes of languages that they are 
capable of describing. Another bhasic question concerns” the 
rquivalence of programs that are written in the same language. 


Jiscovery of a translation algorithm that uses directly the 
formal definition of either or both of the two languages 
involved is a hasic »rohlem in the construction of programming 
language translations. A "table driven" comniler is such a 
translator, Present translators of this type allow the 
implementer to specify hts translation algorithm, using a 
special-purpose language ‘or languages). At present, only syntax 
analysis can be done automatically. Parsing algorithms can be 
zenerated automatically, ziven a formal description of the syntax 
of a language. <A general translation algorithm would accept the 
definitions of two languages and synthesize « translator between 
the two languages. 


Professor John J. Donovan and his students Investigated problems 
in the formal definition of programming languages, using canonic 
systems, models of file systems, and comprehensive operating 
systems. Professor Arthur Evans, Jr. and his students studied 
(with Prof. John 4. Wozencraft), probiems of defining orogramming 
languages, using lambda-calculus, extensible languages, compiler 
theory and construction, and the PAL language. Professor Robert 
4. Grahan and his students investigated the probtems of a 
compiler specification system and of a software system design 
languase. 


Educational Activities -- Rohert HM. Graham 


‘embers of tne Proszramming Linzuistics Group continue to take an 
active role in course development at M.I.T. Althourh we shall 
not do so in connection with every group report, we nay digress 
briefly to say a word about that ork. 


Evans and Wozencraft are in charge of 6,234 (Electrical 
Engineering, Programming Linguistics) which ts the first of a 
three-course core sequence In Computer Sclence. Approximecely 79 
students took 6.231 last year. 
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Graham and Donovan are In charge of 6.251 (Electrical 
inzineering, Dizital Computer Programming Systems), the basic 
course in computer software (covering assemblers, loaders, 
compilers, and time-sharing systems). This course, although it 
is mot required in the Computer Science curriculum, has an 
enrollment of over 300 students each = year. The course is 
continually being updated to reflect the latest software 
developments, such as the Multics system. 


In the Sprine term, Donovan tnitiatei a new course, 6,686 
(Electrical Enzineertng, Software Projects Laboratory). The 
intent of this course ts to sive students an opportunity to 
participate in the desian, implementation and management of 
software projects that are significantly more complex than any 
project that can he assianed in the standard courses. The course 
was a great success, enthustastically received hy the students. 
The central theme of the course was the design and implementation 
of a comprehensive operating system for the 1130 computer. 
Approximately 35 students oarticipated in this effort. 


Grahain completed the tnitial planninre for a new course, 6.531 

(Electrical Engineerinz, Principles of Programming Language 

Processors), which will be offered for the first time in the Fall 

of 1969. This course will cover advanced topics tn the formal 
specification of programming languages, construction of 

programming language processors (principally compilers), and the 

application of profranming linguistic theory to the construction 
of compilers. Students will be given the opportunity to build 

language processors, using a compiler specification system 
currently under development. , 


Software System Desizn Language -- Robert M. Graham 


The inherent complexity of many current software systems is 
greatly magnified by the unavailabitity of any high-level 
language matched to the problem of the _ specification and 
description of software systems. The language should he such 
that the description of a system contains enough information that 
analysis and simulation of the system performance are 
straithtforward and effective. We hegan in the Spring to study 
this problem. Our initial attack is the attempt to discover what 
primitives are really basic and common to all types of software 
systems (e.2., compller-bullding systems as well as time-sharing 
systems). For example, table management seems to appear in every 
system and even in single software components such as compilers, 
assemblers and loaders. Should one then include, in a_e system 
specification language, facilities for describing tables and 
their management (i.e., specification of the contents of an 
entry, definition of the keys and search rules, and defirition of 
the operations such as modification and deletion, desired for 
each key)? Perhaps such facilities are too high-level and 
specific and the solution instead is more powerful abstract 
structure manipulations. 


Javid D. Clark, in hts Doctoral thesis, ts currently examining 
those aspects of a system that can he expressed as 
transformations between different address spaces. A large part 
of the Multics file system is concerned with mapping symbolic 
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segiwent (file) nanes into segment addresses usable by the 
hardware. This address mapping was the subject in a sequence of 
lectures Graham gave in Summer courses at the University of 
Atchigan and at the Hebrew University. 


Graham also discussed use of high-level language for system 
Programming and specification at the NATO Working Conference on 
Software Engineering and at the Honeywell Second Annual Software 
Symposium, 


Students also studied problems related to this project. Caria ‘. 
Vort investigated the description and on-line matntenance of a 
computer system menory. Robert L. 3ushkoff, in his Bachelor's 
thesis work, used a comniler specification system, currently 
being implemented, which specified the parsing, intermediate 
lanzuage, anid symbol tables for a CIMPL compiler. CIHPL is a 
descendant of PL/1 destened by Graham and Prof. Jerome fH. 
Saltzer; it is used tn 6.233 (Electrical Ene ineering, 
Prozramaing Linguistics) for system pnromramnineg. Jeffrey R, 
Spirn's Sachelor's thesis was "A Simulation Motel of a §Larse 
Multi-Processing Operating System", Wichael J. Greata, for his 
Master's thesis, desizned and implemented, on the PDP-8, a 
deburzting aid which dynamically displays the sequence of 
subroutine calls made during the execution of a program. In 
addition, the system permits the user to stop the action and 
display the values of selected variables in the program. 


A Compiler Specification System -- Robert M, Graham 


In) June, we began a project to implement a new compiler 
specification systen. The system is being inplemented in 
‘tultics with the aid of 3CPL (Basic fombined Programning 
Language). Use of BCPL makes the syste:: casily transferable to 
other computers. (3CPL compilers exist for at jeast seven. other 
computers, including the 360.) liowever, we feel that a 
satisfactory solution to the problem of snectfication of code 
generation tables -- especially code optimization -- resuires an 
on-line interactive environment. 


The initial system design consists of processors for three 
specification languages: resular expressions for the 
specification of lexical analysis; reductions for the 
specification of the parsing and interpretation of the parse; and 
a imacro language for the specification of code eeneration. The 
reductions languase is  hased on earlier work by David 1D. Clark 
(Aon. A). A unique feature of the reductions language processor 
is that it compiles the reductions rather than Interpreting them. 
The reductions are compiled into resular BCPL programs which are 
then compiled by the standard compiler. This permits a system 
user to write action routines (for tnterpreting the parse) in 
BCPL and to merze then easily with the reductions. 


Canonic Systems -- John J Donovan 


A canonic system is a simultaneous recursive definition of sets 
of strings on a finite alphabet. Canonic systems can he used as 
a formalism for specifying both the syntax of a programming 
language and its translation. Canonic systems are sisnificantly 
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more Powerful than the traditional Backus-Naur Form (BNF) 
formalism, hecause the latter is unable to specify 
context-sensitive features. By restricting the canons of the 
systems, a hierarchy of less powerful canonic systems can he 
defined which include correspondents for some of Chomsky's types 
of formal grammars. 


In a Master's thesis research, Robert Mandl has developed = an 
alternate method for presenting the theory of canonic systems. 
In addition, he has developed a new hierarchy of canonic systems 
which relates general canonic systems to all. four types of 
formal grammars defined by Chomsky. ‘fe has also shown that. all 
attempts to define a mathematical system that exactly corresponds 
to the recursive sets must fail, 


uiven a canonic system, C, it Is possible to sfenerate another 
canonic system, C,,, which its a proof measure function that is an 
indication of the complexity of the language defined by C. One 
can also generate a conpanion = system which characterizes the 
recognition of strings generated by C. Joseph P. Hagrerty, in a 
Waster's thesis (App. A), was able to show that algebraic 
bounds on C@can be derived from the structure of the system ( 
d@ also established a relationship between the complexity of the 
recognition procedure and the complexity of the language 
description. 


Hoo-i4in DD. Toong has develonved a means for formalization of 
discrete-event simulation languages using the lamhda-calculus and 
graph theory (App. A). Through anpnlication of lambda-operators 
to the graph theoretic representation of data structures, one can 
derive hasic properties of systems modeled by these data 
structures, 


File Systems -- John J. Donovan 


During his thesis research, Stuart E. Madnick developed a model 
for file systems which fsolates the basic function of file 
systems (App. A). This model is useful for comparing various 
implementations of file systems and in judging various design 
Proposals. 


1130 System -- John J. Donovan 


In order to give students the opportunity to deal with the 
Pproplems one encounters in the design and implementation of a 
comprehensive operating system, we began a project to design and 
implement a comprehensive operating system for the 1130. The 
system also provides a test bed for new ideas in compiler 
implementation. The basis of this project is student 
Participation; the design and implementation are done entirely 
by students. Several Master's theses have grown out of this 
Project. 


Clifford R. Yollander's Master's thesis (App. <A) was on. the 
design of a multi-tasking feature which not only provides for a 
mixture of batch-processing and interactive tasks but also 
accommodates devices such as graphic displays and communications 
facilities. Charles A. Dancy is working on a COB0L conniler for 
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his Master's thesis, and Leon E. Travis on a COBOL interpreter 
for his Master's thesis. 


Programning Linguistics -- Arthur Evans, Jr. 


Completion of the notes for 6.231 (Electrical Engineering, 
Prozramning Linguistics) gave closure to a considerable amount of 
past conceptual research. Our understandinz of the Imperative 
constructs in PAL had finaliy reached the point where we could 
write the chapters on assisnment'§ statements and on transfer 
control. We expect to give a two-week M.I.T. special Surmer 
Session in July which will cover all the programming linguistics 
material. This will be the’ first major presentation of this 
material to a non-M.I.T. audience, 


In connection with the Summer course, we have implemented. the 
language PAL (which plays an important role in the teaching of 
this material) on Multices. The students will use PAL to do 
exercises on the computer. 


Extenstble Languages -- Arthur Evans, Jr. 


Support for our research in extensible languages comes in part 
From IASA's Electronic Research Center and in part from ARPA 
through ONR, Robert 4. Thomas Is working on a Ph.D. thesis in 
the area of syntactic extensions to progranming languages. J. 
Vix Fulton did an imbedding of SLIP (Symmetric LISP Processing) 
in BCP for his Master's thesis. This Imbedding is untque; it 
was done entirely in the hither languaze, with no assembly code 
required at all. To illustrate this point, the same 
implementation was demonstrated as workable on hoth CTSS and the 
360, with changes only to the data-describing declarations. 
Because the machines have different word lenzths, this ts a 
significant accomplishment. 


Conpiler Theory and Construction -=- Arthur Evans, Jr. 


Franklin Lb. DeRemer, tn the abstract to his Doctoral 
dissertation "Practical Translators for LR(k) Languases" (App. 
A), states: 


A context-free syntactical translator (CFST) Is a 
machine which defines a translation from. one 
context-free language to another. A\ transduction 
grammar is a formal system hased on a context-free 
grammar and it specifies a context-free 
syntactical translation. A simple suffix 
transduction 7rammar hased on a context-free 
Zrammar which is LR(k) specifies a translation 
which can be defined by a deterministic push-down 
automation ()DPDA). 


| present a method for automatically constructing 
CF3Ts (DPDAs) from those simple suffix 
tranduction grammars which are based on the LR(k) 
gwramnars. The method is developed by first 
considering grammatical analysis from the 
string-manipulation viewpoint, then converting 
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the resulting string-manipulatine algorithms to 
DPDAs, and finally considering translation from 
the automata-theoretic viewpoint. 


The results are relevant to the automatic 
construction of compilers from formal 
specifications of prorramming lanmuagces. If the 
specifications are, at least in part, bhased on 
LR(k) grammars, then corresponding compilers can 
be constructed which are, in part, based on 
CFSTs. 


Joseph W. Slater completed a Master's thesis on GENRAP, a system 
for generating automatically a parsing adgorithn from a BNF 
description of a language. The inethod used was copied from a 
scheme of Cheatham's. 


The BCPL Language -- Arthur Evans, Jr. 


We brought the implementation of 8CPL on the J3Hh 360 at the 
information Processing Center to a consistent state so that the 
language could be exported to other installations. We expect to 
send out tapes during the Surnmer. 


We are assuming responsibility for maintenance of the BCPL 
language on Multics. We nlan to improve the Interface between 
BCPL and the rest of the Multics environment to make it easier to 
write and run 3CPL programs on Itultics. We also hope to make 
significant improvements in the efficiency of the code compiled 
by the compiler. Because the compiler is written In 8CPL, this 
will also improve the efficiency of the compiler. 


The PAL Language -- Arthur Evans, Jr. 


PAL has heen inpiemented on Multics. Because PAL is written in 
BCPL and the Multics version of BCPL is compatible with that on 
CTSS, the entire effort took only about three man-months of 
work. 


Marcus C. Bortue, in his Master's thesis research, completed and 
fully documented a student operating system on the 360. The 
system runs as a subsystem under 0S/360, batching many = student 
PAL jobs and making a sianificant saving of computer time. 
Although we have neither made nor contemplated any major changes 
in the PAL language, the Master's thesis of Stephen . Zilles is 
a significant effort toward extension of PAL's data-description 
abilities. Results of this work may ultimately influence PAL. 
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5. THEORY OF AUTONATA 


Professor Chung L. Liu and his students studied several problems 
in the general area of abstract machine theory during the past 
year, 


One of these was the algebraic structural theory of finite state 
machines. A finite state machine, viewed as an Information 
transducer, can be characterized by lattice functions describing 
the relationship hetween the supply and demand of information at 
its input and output’ terminals. Using the lattice function 
characterization, they were able (generalizing the definition In 
the literature) to define the notion of vair algebras. 


The second problem was the structural properties of a class of 
probabilistic finite state machines, known as definite machines. 
Ltu's group established necessary and/or sufficient conditions 
for testing the definiteness of probabilistic finite state 
machines. 


finally, Liu's sroup was able to establish a synthesis procedure 
for a mininal nonlinear shift rezister which generates ai given 
output sequence. (This problen is a efeneralization of the 
problem of synthesizing a minimal linear shift register.) 


Professor Zvi S. Kohavi and his students investigated the problen 
of detecting and diagnosing fallures in combinatorial losic 
circuits. They developed a procedure to determine "nearly 
minimal" sets of tests which detect single failures and locate 
them to within an equivalence class without resorting to a fault 
table. 
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6. ELECTRONIC SYSTEMS LASORATORY 


introduction 


The participation of the Electronic Systems Laboratory in Project 
MAC, begun in 1963, continued this year. Graphics Research 
(reported at the end of this section) and two other ESL projects 
had close working tiles with, and received partial support from, 
Project AC. One of these, the Comnuter-Aided Design Project 
(CANP) for the UES, Air Force Materials Laboratory, 
Wrizht-Patterson ‘Air Force Base, has developed the AED laneuage 
and systems. The other, Prof. Michael £. Dertouzos's prolect 
for National Aeronautics and Space Administration (Electronics 
Research Center), worked with on-line simulation of networks and 
systems. Both projects used the Project MAC computer facilities 
and received direct Project MAC support of facilities and/or of | 
graduate students. 


The AED Lanzuaze 


The iW.1.T. Computer-Aided Desizn Project has been engaged since 
i950 in research on the application of the concepts and 
techniques of modern data processing to the desian of mechanical 
parts, as an extension of automatic programming APT (Automatic 
Programming Toci) systems for numerically controlled machine 
tools. Whereas part-programming [ts a relatively bounded domatn 
which permits a single, standard APT program and language, the 
problem of desiagning larse systems such as an aircraft is so 
complex that no one design program or language can be constructed 
that will serve all the varied needs. A very large number of 
desimn languages and prorrams jis required, each tallored to a 
specific aspect of the over-all desien process. Since, if 
traditional methods were used, the time and effort needed to 
construct each specific language and promram and to make it 
available on computers of various types could equal that of the 
entire APT development, the project's major effort for the past 
several years has been the development of technicues for 
automating as much as possible of the process of constructing 
specialized languages and programs, and development of the 
process of moving programs from one computer to another. The 
result [is the AED (Automated Engineering Desizn) family cof 
programming systems including: first, the AED-1 system, whose 
domain is general programming, compiling and operating of 
programs on essentially any large-scale computer; second, the 
RWORD System, which builds a lexical processor; third, the AEDUR 
System, which builds a parsing processor; and fourth, the CADET ; 
(Computer-Aided Design Experimental [ranslator) System, aimed at 

a generalized approach to computer-aided desizn applications. 


Major emphasis last year (Project MAC Progress Report V, p. 49) 
was on the subject of machine independence and the process 
required to convert the AED system programs to new and basically 
different computers through "bootstrapping". The bootstrapping 
procedure (described in detail in Project MAC Progress Report V) 
resulted in preliminary releases of AED for the [BM 360 and 
Univac 1108 computers in the Spring of 1968, 
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These systems were incomoi ete and differed not only from each 
other, but also from the }.1.T. 7094 system from which they were 
derived. The group's major effort during the past year has been 
a complete reworking of the AED system packages and bootstrapping 
procedures to produce a new, fully releasable Version 3 AED-1 
system for the IBM 360 In both OS (Qperating System -- batch) and 
CP/CMS (Control Program/Cambridge Monitor System -- time-sharing) 
versions. The Version 3 release is described in the next 
section. At the same time, the system was’ re-bootstrapped back 
to the 4.1.7. CTSS on the 13M 7994, on which it was developed, so 
that all campiler versions on both machines would be _ identical. 
(The 7094 re-bootstrap is completed, but It has not yet been made 
operational. It, like other Version 3. releases, produces 
assembly language output, which requires a further assembly pass. 
Since the older binary output (Version 2) AED on the 7094 Is more 
efficient, It is still being used. Version 3 hootstraps 
compatible to two other machines -- the Univac 1108 and the GE 
645 (Multics) <-- were vartially completed, and are described in 
the second section below. The major effort made to improve AED 
documentation is described in the third section helow. 


‘ith the release of the AEN 260 system, hoth the Air Force 
fundine for the Computer-Alded Desian Project in ESL and the ARPA 
funding (through ONR) of AED work in Project MAC terminate. 
Douglas T. Ross and his key staff members leave M.I.T. in July to 
form a private company. 


The AED/360 Release 


This section is a summary, mainly in AED terms, of what 
constitutes the AEN/360 system as released. Version 3 of AEN-1 
for 3M 360 computers consists of the AED-1 Compiler for the 
AED-0 Language, the AENDJR System for language definition, and the 
AED Library of system-building packages. (The RWORD System for 
setting up lexical processors, which was available In earlier 
7094 and Univac 1108 releases, was not Included In the July 1969 
Version 3 release.) The release Is on four reels of magnetic 
computer tape. The four tapes are divided Into the following 
lorical categories, with one tape for 0S/360 users, two tapes for 
CP/CIS users, and one tape for both OS and CP/CMS users: 


Tape No. Contents  j. $$ Qperating System 
1 Source Programs 0S/360 and CP/CMS 
2 Partitioned Data Sets 0S/360 
for Runnins AED 
3 Modules and Libraries CP/CMS 
for Running AED 
4 Text Files CP/CMS 


Tape 1 contains the 540 source programs for the AEN-1 Compiler, 
the AENJIR System, and the AEN Subroutine Librartes. The tape 
also includes two simple test cases, one for the compiler and one 
for AENJR, toe check that no malfunctions (e.2%., bad tapes) have 
occurred In the cody process. The total tape contains 
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approximately 64,000 card equivalents of EBCDIC (Extended 
ginary-Coded Decimal {nterchanzge Code) source data written 
primarily in the AED-0 Language, with a few 360 Assembly 
Language, AEDJR, and RWORD System input-langzuage programs. 


Tape 2 contains the OS modules and libraries required to run AED 
on the 360 under OS. All modules are designed to work under PCP 
(Primary Control Program), MFT (Multiprogramming -- Eixed number 
of Tasks), and MYT (Multiprogramming -- Variable number of 
Tasks), but not under DOS (Disk Qperatine System). The eight 
files include: AED-1 Compiler, Basic Af) Text Library, "Fast 
Free" Text Library, AEDUR System, AENDJR Text Library, AED-1 Macro 
Pass, TRACE Debugzing Text Library and AEN-1 Text Library. 


Tape 3 contains the MODULE, TXTLI3, and MACLIB files of the 
CP/CHMS version of AED. The tape is divided into four files: the 
AED-1 Compiier and its run-time support libraries; the AEDJR 
System; the AED-1 Nacro Preprocessor; and the OSTOCHS (0S-to-CilS) 
utility, which permits the OS Source tape contents to be read 
selectively into the user's file directory. 


Tape 4 contains all text (object) files for the AED system; it is 
desiered for CP/CHS only. Files include the AEFUJR System and its 
libraries; the Macro Pass; all AED-1 Gmpiler programs; the 
"East Free" Packaze; and the TRACE Debugzing Package. The tape 
also includes the SHOWIT AEDJR Example, described in the fourth 
section below. 


Documentation of AED 


Ross and other staff members are preparing, for publication in 
the Fall of 1369, an 2 
Enzineering with the AED-0 Language, a hnok based on a _ lecture 
series they Rave in the M.t.T. leetrteal Engineering 
Nepartment's course 6.687, Software Engineering. 


At the same time, Clarence G. Feldinann has completely revised and 
reworked the AED Programmer's Guide. This has been a loose-leaf 
collection of memoranda and system “flashes", but with this 
reworking it now conforms to the standards of the new Version 3 
AED release. 


By the end of June, hoth books were ahout 80 per cent completed, 
and both were scheduled to be finished by early September. 


The SHOWIT Systen: An Example of the Use of the AED 
Approach 


From the beginning, the CADP group has viewed “design” as only a 
special term for a type of problem-solving. However, the field 
of man-machine problem-solving is too broad to permit a_ single 
system to be used for all applications. Wany systems are needed, 
and each must: 


1) Use the specialized jargon of its particular 
field of application; 


2) Require little or no knowledge of computer 
programming to be used effectively; 


3) Be “evolutionary" to adapt to the changing 
needs of its users; and 


4) Be created and maintained by the users 
themselves or by skilled staff in intimate contact 
with the users. 


For these reasons, the project's efforts have been directed 
toward a system for making systems rather than a single 
computer-aided design system. What has evolved is a "system of 
systems for making systems" and an orderly method for applying 
it. The CADP group refers to this collection of concepts and 
working tools as the "AED approach". 


The SHOWIT System, which originated as an unscheduled 
demonstration of AED capabilities at the Second AED Technica} 
Meeting in January 1967, is a tutorial example of the application 
of the AED approach, with particular emphasis on the facilities 
of the \EDJR parsing for implementing new languages. The SHOWIT 
lanzuase is a subset of the Iverson language, chosen during the 
January 1967 AED Meetine. 


3y using AEDJR as the framework for the new = system, the 
programmer can implement quickly ali the procedures and erammar 
rules. In particular, the initial plateau of SHOWIT acts as a 
subsystem of AEDUR. By means of three successive commands to 
AEDJR, one can cause the following: 


1) Reading in and making active the grammar rules 
for the entire SHOWIT language; 


2) Invoking a specitai set-up procedure, written by 
the programmer; and 


3) Executing any specific program, written in the 
SHOWET lcaguaze. 


The lexical processor for the SHOWIT system is a specially 
constructed RWORD machine. Its item-building rules conform to 
the designer's specifications for the SHOWIT language. In 
particular, the lexical phase identifies and discards comments 
written by a user In his input message. The RWORD machine is 
invoked -- that is, a request is made for a new system to be 
extracted from the input. stream -- by a special procedure the 
programmer writes to replace the standard one provided in AEDJR. 
The programmer's procedure is called by the First-Pass Algorithm 
each time a new input item is needed to continue the parse. A 
second RWORD machine is used in the SHOWIT system to read in data 
values which a user types on-line. This machine accepts numeric 
items written in Integer, decimal, or E-type format. 


The SHOWIT system has been documented by John R. Ross and Douelas 
T. Ross, ‘n "The SHOWIT System: An Example of the Use of the AED 
Approach", ESL-TMN-394, June 1969. 
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Syntax Definition Facility 


Robert S. Eanes, in his Master's thesis research (App. A), 
studied the language-definition problem. dis work resulted in a 
system called the Syntax Definition Facility (SDF). This is an 
Nteractive system that allows. the designer of a computer 
Programaing language to define the syntax of his language ina 
relatively simple natural meta-language. A set of tables for 
deriving a general parsing algorithm Is produced from this syntax 
definition. If the system detects possible ambiguities or 
inconsistencies in the definition supplied hy the languare 
desiener, it will report them and try to indicate the source of 
the problem. The system Includes test and debugeineg facilities 
to aid the languaze designer. 


The SDF meta-language allows the langzuame designer to specify the 
syntax of his language by writing a series of Sample statements 
which are marked to indicate how they should he parsed. Each 
sample statement specifies the "kind of value" or Sel 

of a construction in the language. By the underlying principle 
of i , which allows any construction to be 
s'bstituted for any other construction of the same semantic type, 
the smal} number of sample statements induces a complete language 
definition, allowing statements of arbitrary size and complexity. 
A syntax definitton in the SDF meta~language is somewhat similar 
to a Sackus-Naur Form of context-free grammar definition, but it 
is more readable and easier to Produce. The algorithm used by 
the system to produce a parser from the meta-language description 
is a synthestIs of the precedence techniques and the AED Language 
definition systems. The thesis will be Issued as an ESL report. 


Un-Line Simulation of Networks and Systems 


The effective use of on-line computer utllities i. the design of 
electrical networks and systems is the main objective of this 
research. This includes studies In the mathematical foundations 
of computer-orlented network and System analysis ard in the 
interactive features essential for the design of networks and 
systems. 


Work In these areas has proceeded along several directions. 
Charles W. Therrien and ‘uber L. Graham completed their doctora) 
dissertations: the former on network tearing, the latter ona 
new recursive approach for computer analysis of non-linear 
networks (App. A). The sroup's work on CIRCAL-I1, 9 a 
feneral-purpose network-analysis Program, continued with the 
development of: a "nseudo user" (a user-defined prozram for the 
automatic optimization of networks); a method for dynamically 
loading and unloading program sections; tmplementation of 
non-linear transient, symbolic-frequency, linear-time and 
recursive-analysis techniques for CIRCAL~!1; and development of a 
general-function and functional capability for CIRCAL-I1. In 
computer-aided system desizn, the group completed the initial 
version of LOTUS, a block-diagram analyzer. It also initiated 
implicit computation = and cont inuous-discrete systems, with the 
objective of developing effective comput ing techniques for the 
solution of linear and non-linear systems of equations. 
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Tearing of Networks 


Therrien has continued work on the use of "tearing" for reducing 
computation and for expediting the computer solution of 


electrical networks. tis thests discusses the question, "When 
and how should a network be torn in order to minimize’ the 
computation necessary for solution of that network?" He has 


provided several answers to this question. They involve use of a 
model for tearing, several deterministic tearing algorithms, and 
one statistical algorithn which achieves statistically 
predictable computational savings over a class of networks. 


A Recurstve Approach to Network Analysis 


Graham describes in his thesis a new technique for the direct 
analysis of non-linear networks through recursive decomposition 
of their network graphs. This approach differs from conventional 
approaches that use iteration techniques. Using the new 
technique, one constructs 4 non-linear function relating the 
given excitation to the required response variables. The 
construction of this function is recursive, and it is in 
one-to-one correspondence with the steps leading to recursive 
decomposition of the network graph. 


CIRCAL<-!! System Developments 


The CIRCAL=-!1 program is a general-purpose, on-line, 
circuit-design program. (For detatled discussion, see Project 
MAC Progress Report V, p. 60.) The matin program, which hecame 
operational last year, performs all the necessary "overhead" 
tasks in computer-aided network analysis; it has convenient 
"olugs" into which various analysis techniques can be connected 
to enlarse its capabilities. There have seen four developments 
in the main system of CIRCAL-11 during the year: 


1) A preliminary investigation and implementation of 
the pseudo-user or Defined-Command Feature. This 
feature substitutes for the real user a user-spec!*'ed 
profzram which "observes" analysis results and ivakes 
appropriate modifications to the network se as_ to 
optimize performance. This system feature has been 
used, for example, to design an oscillator circuit by 
automatically adjusting circult parameters so that the 
oscillation will be at a pre-specified frequency. 


2) A method for controlling the dynamic loading and 
unloading of program sections in order to maximize 
available storage. 


3) Extension of the definitional capabilities of 
CIRCAL-!t, including the results of (1) above. This 
includes a general ized capability for defining 
functtons and functionals. Functions are used in 
CIRCAL-I!! for non-linear element characteristics, for 
source waveforms, and for “post-processing” of computed 
network variables. Functlionals are used in handling 
hysteresis, thermal effects, and other phenomena that 
exhibit memory. 
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4) Prenaration of CURCAL=tt ousers' and programmers! 
manuals, 


Yew CERCAL-I1 Analysis Techniques 


Duriny the nast year, the croun hezan develoonment of several new 
analysts techniques, The correspon ling tasks involve 
implementation of: first, the recursive-analysis technique, the 
theory of which has hean develoned, as discussed ahove; second, a 
symbolic frequency-analysis technique for larce, linear, sparse 
networks; third, linear-tine techniques; fourth, non-linear 
transient-analysis techniques. The symholic-frequency analysis 
is of the so-calle1 conniler type, which reaqnires little 
execution time and is especially suited to larme, sparse 
networks, 


Connuter-Aided System Desian 


The troup's work in connuter-al te System desien concentrated on 
three main areas: 1) completion of the fevelonment of LOTUS, a 
protran for the on-line simulation of hlock-iagram systems; 2) 
synthesis of continuous systems; and 3) implicit conputation. 


The latter involves research on synthesis tools similar to those 
used in the synthesis of discrete finite-state machines. It 
entails a new anproach to the orvanization and structure of 
co wutine hardware, capahle of solviny implicit alrebraic and 
differential equations, 


Sraphics Research 


The ESL Misnplay Groun continued its research on hardware and 
software techniques for eranhic interaction in a time-sharine 
environment, In past years, with sunnort fron the Air 
Force-sponsored Cowuter-\i ded Vesitn Project and Project AAC, 
the troun daveloned the ESL Nisnolay, a sranhics disnlay with 
snecial SYartware canahilities for Nicture rotation and scaling, 
an’ the ARIYS (Advanced Remote Disnlay Station), a storare-tube 
zranhiecs terninal for remote oxeration via a telenhone-line 
connection, The «roun's work this year dealt with renote 
oneration of the =SL  Visnlay Console, connuter interfaces to 
support additional ARDS units, a 4ard-cony facility for APNS, and 
sraphics-supnport software, 


FSL Console 


uring the year, the Project MAC 7094 comouter was# moved to the 
(4.1.7. Information Processing fenter, and the former !PC 7034 was 
retired, Since the present Implenentation of the ESL \isplay and 
its P?DP-7 huffer computer requires direct connection to a 7094 
data channel, they also were moved fron Project MAC and made 
operational in the new location. At the same tine, the second 
ESL Display and PDP=3 buffer computer, which were to have heen 
usef remotely in ESL, were moved to Project MAC to maintain 
zraphics capahility there. In late Way, the 50-kilobit teleohone 
connection hetween the PP-7) and POP-9 was installed, and 
check-out of the link and associated communications software 
Seaman, By the end of lune, communications hetween the P9P-9 and 
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P9P-7 were estahlished, ani the remainder of the orotram modules 
for remote-display onerationr were heine checked out. Study 
continues on means for automatic start-up from the remote 
location, This remote start-uo requires that hoth the PNP-9 and 
29P-7 he running and that they contain at least the bootstrap 
cormunications modlules for orotram loading from the 7094, 


Juring the Sprinsz, a Sylvania Data Tablet was incorporated into 
ane of the two consoles of the ©SL Display at Project AAC. An 
interface was constructed to o%ermit readin= the tablet output 
firectly into the PIP-9 via input-output transfer instructions. 
\ character-recotnition oprotran, which will be the first use of 
the tablet, is heins written. 


Eitht ARDS units are now used, orimarily with the CTSS 7094 
comouter. That number nosed a oroblem, a shortazte of computer 
ports. The 7750 communications interface of the 7094 will 
support a total of four "hith-sneei"™ (1200 hos) lines; only two 
such lines were operational for \ROS use at the start of the 


year. An additional voort, imnlemente! during the year, hroumht 
the total to three; and there was onrorvress in convertine§ the 
yoresent dedicated use of the renainine Nort to ARIS 


compatibility. 


Looking beyond this four-nort canahility is ai study of an 
information concentrator for the 7094 which would orovide up to 
15 ARS onorts, olus SO-kilohit connections to the two ESL 
Displays ani oerhans to the Interface Messacte Processor of the 
ARPA comnuter network, The concentrator would be a-small 
computer interfacei to the 709% OYfrect Nata YMaviern (ND), 
yrovidine 36-hit word transfers at un to 170,990 words per 
second, The PNP-7 buffer computer for the SSL Display, which 
oresently occupies the 999 interface, would then be connected to 
a 59-kilobit serial port of the concentrator. Several smatl 
computers have heen evaluate! for use as concentrators, The 
general system conficturation and buffer nemnory requirenents have 
heen determined, 


The System/360 data adapters for USASCI! (USA Standard Code for 
Information Interchanse) interfaces oanerate only at Teletype 
spee! (110 bns). Thus, at M.1.T. those ‘ARISs used with the 
360/67 computer must operate at this low speed. The ctroun has 
assisted the IPC to plan for alleviation of this problen = by 
acquiring a snecial fatar~adanter unit to connect to a 360 
multinlexer and to provide a number of 1200-h9s (or hicher) ARNS 
Ports. 


The successful overation of an \ROS for two weeks, 22 ANutust to 3 
September 1958, rat the M.I.T./Technical University of Serlin 
Conference, was a Hithlivtht of the year. International 
cooperation estahlished a 1200-hns circuit via a dial-up 
connection fron Berlin to Frankfurt with an ITT atatel 
eonnection to ‘lew York, anda dedicated AT*%T line from "lew York 
to the Project MVC 7094, There were several onerators to deal 
with, not all of whom spoke = Enelish. Except for oroblens in 
re-establishing the circuit each time it was to he usel, however, 
oneration was trouhle-free over many hours of use, 
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ARNS Yard-Conoy Nevice 


The experimental hard-cony station ‘discussed in Project MAC 
Provtress Report V was assembled and tested. This station was 
desitned by Albert Vezza. {t consists of: a Tektronix 611 
storate monitor driven by an ARDS Aisnlay; a modified 3h 
try-silver paper print unit; a lens to image the display screen 
at 1:1 on the recording paper; and aio shutter mechanism. 
Modifications made to the orint unit increased the tempnerature of 
the heated developnent cylinder to process photozraphically a new 
Xotak dry-silver-process paper that has a photographic speed 
eitht to twelve times faster than the paper for which the print 
unit was desitned. With that paper, heating constitutes” the 
entire developnrent process. The new Kolak paper is first 
stabilized by heating to a temperature of 240 desrees; it is 
then exposed to ultraviolet litht to bring out a visible imaze. 
Total developinent time is 10 seconds. 


With the increase 1 speed available in the Kolak paper, it’ is 
possible to make zood-density »orints with five-second exposures 
from the storame-tube screen, comnared to the 30-50 seconds 
required for the old paner. The wide-antle-to-wide-anzle lens 
used to obtain 1:1 imatine of the 6-1/2" x 8-1/2" disnlay screen 
consists of two aerial canera lenses hack-to-hack. {t has an 
equivalent f-ratins of 2.25. Lens cost in quantities of 19 or 
more would be ahout $300. 


In June, several problems remained concernin® onerational use of 
the hard-coovy terminal. One problem is the heat generated by the 
develonnent cylinder if the unit is left on continuously, ready 
to print at any time. Another is devisins a satisfactory method 
for intetrating the hard-cony terminal into the time-sharing 
system. One method for this would be to slave the hard-copy 
terminal to an ARDS when prints were to be nade, but this would 
require either providing a local operator or having the hard-copy 
terminal continuously dialed-in through one of the already-scarce 
ports. Another method would be to Provide a canability in the 
time-sharing system for saving output pictures on  dise for 
hatch-printing at scheduled tines. The sroup is studying these 
problems and hopes to have at least limite operational use of 
the hard-copy terminal in the next few months. The hard-copy 
terminal should provide a remote hard-copy capahility at a 
capital cost of about 50 per cent greater than a basic ARDS. 
Paper cost compares with that of office copiers. 


Graphics Software 


The GRAPHSYS software support system provides a convenient, 
high-level and nearly display-independent interface hetween user 
programs and the ESL Display Console or the AROS. uring the 
year, hbantel E. Thornhill and Christonher Reeve made a number of 
chanszes to improve compatibility between the ESL console 
procedures and the AROS procedures. Reeve's Master's thesis 
(Ano. A) described procedures for interactive craphics on the 
ARDS. In December, ESL and Project MAC issued a technical renort 
CMAC TR-56, Apno. 3) describing GRAPHSYS. Several student 
projects contributed to the aurmnentation of GRAPHSYS 
capabilities. Some are discussed below. To make granhics more 
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easily available to the seneral community of time-sharing users, 
the sroup formulated plans for a coherent collection of graphics 
programs which would be available to allt users. It hegan 
collecting existinz programs ani set up an on-line documentation 
system, using the TIP information retrieval subsystem of CTSS. 
Wenbers started on the following procedures for inclusion in the 
common file: 


1) A data-plotting package for plottinz data on 
rectilinear, semi-log, loz-loz, and nolar 
coordinates, and a facility to plot bar craphs; 

2) A contour-plottingz vackaze; 


3) V generalized lithtehutton vackare; 


eal disolay and simulation vackace for 
i 


) Procedures for dumping data structures on to a 
isc file and for retrieving them fron aie disc 
i 


— uN 


Other Hardware Developments 


Robert J. Ascott, in his thesis project (App. A), sodified a 
low-cost resistance-paner (Teledeltos) data tablet (which F. f. 
Blount had develope! earlier) so that it OSerates with a-c rather 
than dec excitation on the tablet. Althourh the orisinal tablet 
(see Prozress Report IV, 3. 92) worked well, with extremely 
simple electronics, it sufferei from the requirement for 
electrical contact between the stylus and the tablet surface, and 
that prevented tracing an already-drawn figure. with acc 
excitation, the stylus coupling is canacitive, and tracking can 
be accomplished through a sheet of paper. 


In another thesis project, James %. Fiasconaro is) adanting ARDS 
circuit techniques to add character and vector-ceneration 
capabilities to an existing npoint-nlottines disnolay for the PDP-9 
computer of the Speech Synthesis Sroup at the Research Laboratory 
of Electronics. This will provide a refreshed eranhic display of 
moterate verformance at a cost considerably lower than comparable 
commercially available units. 


In connection with the hith-sneed dicital link between the POP-7 
and P9P-9, Thomas L. Smith, carried out a thesis study (App. A) 
of: trade-offs between transmission snpee1 and CPU time required 
for on-the-fly error checkinz; error recovery procedures; and 
ootimal lenzths for messaze locks. The results of this study 
indicated that the present 50-kilobit sneel! is adequate, that 
the 230. 4-kilobit sneed initially specified for the PNP-7/P)P-9 


link is unnecessarily hich. 


__ 
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7. TECHNICAL INFORMATION PROGRAIA 


on the Technical Information Project (TIP) has been carried 
out in the li. 1. T, Libraries, with support mainly from the 
National Science Foundation in the early years, now with support 
From many sources within and outside MM. f. T. Currently, several 
specific tasks are funded hy Project MAC. Since this year marked 
the termination of the develonment effort related to CTSS, it 
seems appropriate to us at this time to sive some perspective on 
the advances made during the five years with CTSS, along with our 
review of current Prosress. Our report drals mainly with program 
developient, and with construction of software systems or 


subsystems, It deals also, hriefly, with education, and with 
training of students and representatives of industry, We shall 
not descrihe, here, our work in System evaluation. data 


representation, and applications, 
Protramn Yevelopnent 


The first set of Programs we developed at TIP, in id-1964, was a 
loose set of on-line retrieva) Prosrams, each tailored to a 
specific point-to-noint information-dissemination need, le 
Created separate Brosrams for retrievine information on titles, 
authors and citations in Yisliographic data nertainins to the 
Physics literature, At this early stage in our development, the 
fleeper Implications of ca on-line eperatine systen were not 
apparent to us, Instead, we dealt with CTSS as thouch it had 
merely made traditional hatch-processirea machine’ capabilities 
more available, The types of Questions and. the’ modes of 
interaction had been thought about in advance. ilew avenues were 
clesed to the user. Our mistake became apparent on CTSS, where 
users were trained to be creative and divergent in their thinking 
patterns, 


SHARE -- William p, Mathews:-- In late 1964, we imolenented 
Of tware, using the phenomenon of hihliotraphic coupling as a 
retrieval criterion, Physics articles that exhibited similarity 
in their pattern of citations to other articles could be 
retrieved efficiently by the computer, Althouth prosranners had 
tried this earlier in an off-line mode, the ability to enlarsre 
the scope of the search anit to alter the search criterion hroucht 
new retrieval possihilittes to litht. 


COUNTS and HIDEXR -- “William 90. Mathews: -- We othen, in 
"id-1965, develoned Programs tO count and index Nib ioeraphic 
citations from one Physics fournal to another, This attempt to 
make availahle an on-line citation index 4rouzht to our attention 
a mismatch between technically frasihle and operationally useful 
processes. While the fdea of a citation intex js attractive to 
the orofessicnal indexer and the librarian as a research tool, it 
is a very unattractive product ta the scientific researcher. 
Retrieval questions are seldom nosed in terns of citation 
characteristics, even thouth the ¢rowth of the scientific 
literature depends strongly on knowledze of 2revious research, 


TIP/I -- Willian 9, Mathews: -= In the Autumn of 1965, we 
developed a unified retrieval protran for the Physics literature. 
In this one packaze, it was Possible tosically to conbine 
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requests for papers pertainine to specific titles, authors and 
citations. We made this retrieval capability nublic to all CTSS 
users. Any CTSS user could interroszate a body of literature of 
aoproximately 30,9300 recent articles in physics. 


TIP/EL -- William D. Hathews:-- It soon hecame apparent that 
ye neeied amore teneralized information-retrieval capability to 
serve the scientist adequately. Unlike the ophysics literature, 
much scientific information fs not neatly packared and published. 
One cannot expect to set information from a library-maintained 


data base. Invariably, the answer must he orctanized = and 
structured into different forms for a final intellizent 
yresentation. Early in 1966, we began work on the TIP/II systen. 


This involved considering retrieval as a general text-management 
orocess in. which the user, with items of information containing 
ASCII text streams, approaches the computer and _ requests the 
partition of these items into #roups satisfying very complex 
selection criteria. Further, a substantial part of the 
interactive vocabulary, especially the naming of fields of 
information in the user's data, shouid be under his control. 
From our experience, it was also evident that control over the 


detailed format of presentation of selected items should 
similarly be under user control. Tous desitn for the TIP/II 
Ctrieval systen evolved. This system has since been used on 
fiscal, personnel and inventory information, as well as on 


hibliographic data. 


TIP/ ti Components -- William D,. Mathews:-- We had to develop 
a ranze of component packates to suoport the TIP information 
system on CTSS. ‘le produced a nowerful string nackamse to allow 
quick manipulation and operations on ASCII character strings, and 
we introduced a list packaze for maintaining lists and 
dictionaries of attributes. ‘We also develoned an efficient disk 
1/0 packase to handle ASCII files. Other important component 
packazes supplied functions for rectrsion, free-storasze 
manioulation, and data conversion, These components formed the 
huilding blocks for nearly 30 file-manipulating subsystems that 
hecame known collectively as the TIP System, 


SORT and HERGE -- William D. Mathews:-- The TIP/I1 retrieval 
subsystem allowed the user to divert information into a file for 


later processing. We jJeveloped a battery of subsystems in June 
19567 to permit orsanization, deletion, extraction and updating of 
these files on an extensive basis. We introduced other 


subsystems to tally or count the number of similar fields ina 
data collection and to produce a new file reflecting this 
summarized information. 


EDIT/I and QEDIT -- Lewis H. Morton: --Insuring reliability 
and accuracy was one of the iiost pressing problems facinz us as 
the size of our data base increased. In mid-1967, we developed 
EDIT/I as a programmable editor with a larse set of text-handling 
requests. It can be used for validation of data or 
transformation of information from one format to another where 
some syntactical analysis is required. In 1968, we introduced 
QENIT. This is a quick ASCII editor that takes its requests from 
a file and writes a revort of all editing chanzes into another 
file. The savine in machine time over the conventional CTSS file 
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editors ranges from a factor of two to a factor of five. The 
jesitn of an E00, IT/IL subsysten is now well under way. EDIT/II 
will compile code if the user wishes to run an editing procedure 
repetitively. Other features include the ability to slave’ one 
pointer to another in such a way that operations performed on one 
yvointer affect the contents of another. 


FI3CHK -- Timothy F. dempsey, Jr.: -- In the Autumn of 1908, 
we developed a subsystem which, with certain protocols in 
construction of RUNCOM's, allows for the convenient running of 
foresround-initiated backtround (FI8) jobs. Some of the problems 
that had to be approache'l were: 


1) The chainine of several FIB jobs into one controlled 


joh; 

2) Prevention of FIS job-leoping in jobs which 
establish other johs; 

3) Prevention of destructive interactions hetween 


chains of FIB jobs working in the sane directories. 


ZOT -© Lewis 4. Morton: -- Ye develoned a seneral-purnose 
environmental subsystem for use within command chains. Used in 
RUNCOl's, the ZIT subsystem can test to see, for example, if 
there is encush space left to write a file of specified length, 
or enouzh time left on a shift to complete a certain operation. 
Special action nay be taken if the snecified condition is not 
fulfillei. ZOT may also be used on-line as a desk calculator. 


TAP and ASEMBL -- Walter |. Nissen, Jr.: -- We have made two 
aivances toward a flexible and powerful text~oriented assembler 
for a modular protranmins system. The first is the development 
of the TiP Assembly Program (TAP), which is a derivative language 
of the [3M 709% FAP macro lanzuaze. TAP includes a number of 
standard functions, implemented as macro definitions, to provide, 
among other features, automatic subroutine linkate, table and 
list manipulation functions, symbolic indexing, and automatic 
recursion. Other pertinent features of the language are 
additional machine overations for pointer manipulatien and 
lozical operations, and the ability to treat standard subroutines 
and their callinz sequences as machine operations. Inttial 
experience with this assembly protram has produced an exclusively 
favorable reaction. One can write significant programs with a 
very small investment of programmer time. This capahillty is 
larcely a result of the naturalness of expression in the new 
language, 


A parallel development has eliminate! some of the major 
difficulties in the assembly of text functions. ASEINBL is a 
text-processing languase which functionally can he considered as 
the front end for a standard assembler, either FAP or TAP. 
ASE!N3L 46oaccepts as input any TIP text and produces as output, 
according to an accomnanying field table, an FAP source file 
containing ASCII coded text strings with whatever hierarchical 
text-pointer structure one desires. There is provision for the 
utilization of all standard TAP machine operations” and 
pseudo-onerations, and, szenerally, for complete progran 
structure, It is possihle to provide a sinsle source file for 
numerous distinct proctrams or for various versions of the sane 
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program, and to allow the assembly of any subset of the total 
source by use of the appropriate field table. 


SHARE -- Timothy F. Dempsey, Jr.ize In the first half of 
1969, we extended the concept of citation sharing, which had heen 
used as a retrieval criterion, to cover sharins of other 
attributes. Further generalization pernits the specification of 
a coupling strenath and the number of items ina file with which 
selected items must share. The linkaze inforination is 
incorporated into an output file for further iteration of the 
process. Now, for examole, one may select items in a personnel 
file using similarity of attributes to some sample item. 


The RUN Systen -- Walter I. Nissen, Jr.et <7 The RUN system 
has passed through several evolutionary stages in the past year. 
The program modules have heen completely recoded to increase 
their efficiency and to broaden their power considerably. The 
nuaber of programs available throush RU! has increased 
enormously. As a result of its expanded usefulness, it has been 


installed as a CTSS command. 


By reducing the coding In RUN SAYED down to a dispatcher and an 
internal table, we have made RUN SAVED small and efficient. At 
the same time, we added the capability to dispatch to prosrams in 
independent SAVED files, not just in the special RUN files. And 
jf RUW, gannot locate a desired program in a RUN SAVED file or in 

. Protram library, tt will search the public file, the 
user's directory and, finally, the CTSS command directory to find 
the desired program. Therefore the RU system enables the 
inexperienced user to have access to virtually all comnands) or 
prozrams that he misht wish to use; the familiar user can avoi‘d 
the necessity of noting each trivial change in the organization 
of SAVED files that micht previously have affected his method of 
access. We have increased the efficiency of core-storagre 
management by RUN by develoning an almorithm to identify those 
subroutines that are loaded for a= oarticular sroup of "main" 
orograms and to return to free storaze any such suhroutines when 
any other "main" program in the same SAVEN file is referenced. 
One "main" program may call another, but any “nain" program not 
called will also be returned to_ free storage just hefore 
execution begins. An unexpected offshvot of the development of 
the RUM! system has been’ the capability to organize the TIP 
library of proerams so that all programs are easily loaded and 


updated at any time. 


PAPER TAPE INPUT AND OUTPUT == David fi. Jordan: -- We have 
extended TI? subsystems to include both reading and punching of 
paper tape on the PDP-7, With the TIP subsystems for reading and 
writing magnetic-tape directortes and the CVFILE subsystem for 
code conversion, we have provided a flexible informat ion-storage 
facility. It is possible to load massive amounts of paper tape 
on to the disk, convert the information to ASCII, and store it 
for archival purposes on magnetic tape. When one desires it, he 
can retrteye this information and format it for typesetting 
machines, and can- reconvert and punch it in the special code 
confisuration needed to run such varied off-line devices as the 
Photon or the I3ft Selectric Composer. 
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FORHAT -= Walter I. ‘lissen, Jr.: -- We have developed a new 
subsystem, FORMAT, that permits hicth-speed bulk output under the 
control of a format cable like that available fron the TIP 
subsystem. This new subsystem vermits easy testing of format 
tables during the fornative stazes of an applications system. 
Also available are features for truncation or trimming of fieids 
of text before output, and tabular formats for columnar data. 


Picture Retrieval -- Walter |. Yissen Jr.: -- de have made 
alterations to the TIP retrieval sthsystem to allow for the 
storaze and retrieval of nictures for display on the ARDS scope. 
Special fornat printers control erasure of the screen and delimit 
the text to be plotted. It is also possihle to ston the output 
gtre m under format control. This is important on a_e scope 
4tsplay since the user should be able to indicate when the text 
has been read and, therefore, when this display may be erased and 
rewritten. 


CALC/1 -- Navid Me. Jordan: -- ‘landlines of numerical text 
within the Tl? system in ways other than retrieval and formatting 
had lonzt been an objective. At the hevinning of the Summer of 
1969, the CALC subsystem was nearing conpletion. Standard 
mathematical functions available in this subsystem will allow 
users to add, subtract, multinly or divide the contents of any 
field of ASCII text 4y the contents of any other fields or by 
constants. Execution of statements in a CALC specification may 
be conditional, permitting a computation to take place only if 
certain fields are present or permitting resu to he reported 
only at some logical break in the data or at the ena of a file. 


This subsystem will be useful in billing and inventory systems. 


Minor Subsystems -- “William D. Mathews: -- To round out the 
capabilities of the TIP system, we introduced a number of minor 
file-handling subsystems. ADION allows the user to add constant 
fields on to every item in ae file; RENUMB allows for the 
renumbering of fiel'iis in each item; and SEQUEN sequentially 
numbers the items in a file. One may accomolish some unusual 
results with such simple subsystems. SEQUEN, for example, may be 
used to add rank-order information to items in a particular field 


that have been sorted. ‘. & 


Freshman Seminars 


In alternate years, we have held freshman seminars on methods of 
handling scientific information. Students are siven on-line 


experience and are encouraged to do _independent study in 
organization and manipulation of information files. 


Summer Sessions 


For the past two years, the Project TIP staff has siven a 
two-week M.1.T. Summer Course in Information Technolosy. 
Professionals from industry, zovernment and other universities 
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have discussed tonics ranging from data collection and (file 
orzanization to analysis of the user  nonulation. 


TIP/III Planning Conferences 


In June 1969, we held a series of conferences on an over-view of 
the planned TIP/II1 system on itultics. Project MAC participants 
and other interested oersonnel from the M.I.T. community 
discussed and criticized of the pronposei system and = sugested 
directions for this future work, 
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8. HATHEMATICAL ASSISTANCE PROGRAM (MAP2) 


Work on WAP2 has been carried out in the School of Enzineerinz, 
mainly with support from the National Science Foundation. One of 
the most difficult aspects of implementing an on-line 


user-oriented system is to ensure that the interface is sensitive 
to the likelihood that users will make errors. In particular, we 
have found that the programming necessary to ensure that all, or 
nearly all, possible erroneous usagzes of an onerator are detected 
ant the user informed (preferably before danagze is done) is often 
more extensive than that required for the operation itself. The 


complications of such precautions can inhthit the OF Ti NAL we Wh Slee 


from himself making sietnificant and useful additions to the 
system. Perhaps for this reason, almost none of the existing 
systems makes provisions for user-added onerators. This problem, 
which we have been considering in sone detail, is the subject of 
Terence |1,. Collizan's Bachelor's thesis (Ann. A). Collisgan has 
destzned and partially implemented a eeneralized and automatic 
yrocedure for checking for errors in the specification of 
operands for onerators during the process of breaking down the 
inout statements into an executable structure. The »orocedure 
utilizes simple hit-checking of “allowed" masks amainst coded 
snecification of data-onerands. The masks allow checking of the 
operands individually, as well as of tore complicated effects 


involving lovically permissible combinations of operands. It is 
intenied that they can be specified to the system with simple 
declarations by "non-progranaiing" users. In effect, the 
error-checking routines constitute the heart of an 


"“a4d1-an-operator" operator. 


Prof. R. Kaplow 


J. W. Brackett T. Colligun 


ADMINS (BUREAU OF SOCIAL SCIENCE INFORMATION) 


I. deS. Pool S. McIntosh 


D. Griffel 
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9. ADMINS 


Ur. Stuart D. Mcintosh and Dr. David it. Griffel, who designed, 
implemented and used Mark FI) Admins to the limits of its 
capability, have been developing another system called Hark VY 
Admins for primary and secondary data-handiing. This work has 
been carrted out in the Bureau of Social Science Information, of 
the 14. 1. T. Center for International Studies, under funding 
mainly from the National Science Foundation. 


Adnins Mark FFI has been described by competitors as a CTSS 
data-reduction and cross-tabulation system, Admins then (four 
years azo) relied, and now (Mark V develonoment) relies quite 
heavily on CTSS facilities for computer Drogranning, 


There were several data-handling problems which we attacked four 
years amo: 


1) Comnuter-usable data tescrinptions for each element 
of comnputer-usable coded data; 


2) Corputer-usable onerations for linking several files 
totether according to cross-reference retations within 
data sets and between data sets; 


3) Interactive data analysis such that an analyst could 
path his way down a tree, look at the cell of a table, 
and make a decision zo recombine his data; 


4) Hither-level analysis facilitated by having 
computer-usable output that can’ be input to further 
processing as well as being used to generate reports. 


The potential users of the system were primarily interested in 
questions of quantity (not ifentity) and in numbers of items with 
certain characteristics (not with numbers of itentified items per 
se). This meant that it was out-of-place to put desisn emphasis 
on "find the identity of the item, then tell me all its 
characteristics", or "tell me the identities of all the itens 
with a certain characteristic", or "count all the items with a 
certain identity”. 


Hark | Adains was shortelived. Host of the develonnent work was 
done on Hark Ji. We dronped certain features (e.7., an automatic 
tree analyzer) and settled ona simplified version called Hark 
i)'. This has heen in use for three” years. The on-line 


documentation and the hard-copy documentation have heen rehashed 
many tines to keep pace with users! cultural chanme. The = systen 
has not changed in essence, but we have added some conventional 
statistics and some output features in order to relate it to 
other systems. 


The aspect of data-handling for which Mark {11 Admins is most 
suited is the handling, for purposes of analysis, of many 
environmental variables with many contingencies among then. fark 
{§i Admins is net particularly suited for prosress control over a 
lesser number of operating variables or over the even lesser 
number of budgetary variables. This is mainly because of the 
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lack of emphasis on up-date and on Item-identity quantification. 


The paging and sermentation in the procedural and non-procedural 
Mark Ill subsystem are ad hoc. This is to sone extent a CT5S 
constraint, but mainly it is hecause of prohlems” of 
conputer-usable data descrintion. 


The data world can be divided into several parts, amonr which 
are: data concerned with physical locations; data concerned 
with storage locations, i.e., data structures; program data, 
i.e., code and storaze tables. ‘ide have not been concerned with 
these, nor with graphics data and textual data. ODJictionary data 
(data description) and data data -- f.e., coded data -- have heen 
the foci of our attention. 


Mark IV was an ad hog attempt to provide identity and quantity 
facilities and cross-reference features not available in Iiark 
11. The Mark ¥V desimn is in essence concerned with facilitating 
fall aspects of primary and secondary cdata-handling regarding all 
questions of fdentity and quantity of ifitem and characteristic. 
In computer jarson, this means development of a programming 
lanzuazge and operating system as. an envi ronnent for an 
organization (sinulation) language that can get to the data base 


via a data-description dictionary as the lansuagre of 
representation. The user lantuare rests on a 
representation-lanzguage redescription facility and a 


report-generation facility. 


we have settled On page and = semnentation techniques -- 
especially on page tables that index identity, and on data 
structures sultahble for the irregular matrices that are our data 
descriptions. This is not intended to prejudice further 
developments in  caterorization of data structures, hut primary 
and secondary data-handling is otir current oroblem., 
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This report should be read in conjunction with last year's. 
This is particularly important to obtain a rounded view of our work 
on vision. In fact, much of what we say this year is logically prior 
to much that we said last year. Thus last year we discussed the abstract 
and theoretical problems related to the interpretation of pictures 
Presented in a clean form (as line drawings or as subsets of an abstract 
retina) while this year we Say more about how to obtain such pictures 
from the real world. The reason of course lies in the fact that 
the "higher level" work did not depend as heavily on prior solutions 


of hard-ware and systems problems. 


We have not reported new work that lies directly in the line 
of development of ideas discussed last year. In particular, we have 
deepened and generalized some of the theorems on computational geometry. 
But the new state of knowledge is indistinguishable from the old on 
the level of discussion of this kind of report. Similarly we have not 
reported very new work which is still at a too primitive level of 
of developedment to be presented intelligibly. This includes work 
on natural language processing, concept information, teaching and 
a number of mathematical topics. The time constant for project of 
this sort is longer than a year. These matters will be dealt with 


in a further report due in the fall of 1970. 
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Analysis of Visual Scenes: The Concept of Vertical 


Problem-Solving 


One can use vision in many ways to find out about objects in 
space. Let us begin with some of the simplest ideas, and then 
move on to the deeper problems. If the camera can move, one could a 
use stereoscopic correlations of two pictures. Or one might use 

local operations to measure motion parallax. These do not solve 

all problems. Stereoscopy does not work well on featureless 

surfaces or curved boundaries; motion effects are misleading on 


plain or shiny objects. Nevertheless, both methods can help find 


the three-dimensional locations of parts of visual objects. 


Focusing provides a third method of location. Its distinctive 
feature is that it needs only a single eye in a fixed location, 
provided that the eye has a wide enough optic aperture. For then 
the idea of measuring distance by stereoscopic parallax merges with 
that of measuring distance by finding the best focus setting. 
Figure 1 shows two views of an object lying on a plane; in the 


photograph it is hard to see the object because of camouflage. 


(The machine sees from a little below the viewpoint of the left 
picture.) Berthold K.P. Horn has developed a program that can 
find the lens setting for hest focus at each point in the visual 
field. The procedure, applied to a series of points along a 
horizontal scan through the middle of the cube, yields the profile 
shown in Fig. 2. (The location of the background is less definite 
than that of the cube because of the background's oblinquity to 
the camera.) Horn's program uses local Fourier transforms and 
compares the relative energy in the high and iow spatial fre- 
quencies. {t servo-controls the lens to maximize the highs, 


and it focuses at least as well as one can do manually. 
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Horn will discuss the focusing procedure in detail in his thesis 
("A Method for Finding the Shape of an Object from One View and 
Application to Face Recognition", in manuscript.) Two techniques 
are available: one uses a fast circular scan to obtain a 
periodic function characteristic of a large neighborhood; the 
other makes a faster scan of a smaller square. An cperation 
manual (Horn, "Focusing", AI Memo 160) gives instructions for 
using the system with the PDP-6 computer and its 
optical-mechanical accessories, as well as some of the theory of 


the focusing procedure. 
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Now we have described three methods for optical range-finding. 
There are many other ways to attack just this one aspect of 
visual technology -- such as flying-spot scanning from a site 
off-center from the camera, holographic methods, and even 
optica’ radar. We must not, however, allow the myriad of 
technological possibilities to divert us from the deeper -- and 
incompletely understood -- problem of developing a visual system 
flexible enough to deal with real-world problems. The goal we 
have set ourselves is to find how to make a system that can 
approach the versatility of human vision. One outstanding 
feature of that system is its "passiveness" -- the great extent 
to which it can see without much special preparation or 
interaction with the objects in the scene. The secret lies in 
the intelligent viewer's ability to combine what he sees with 


what he knows about his world. 


To pursue this, we have concentrated on the problem of 
recenstructing a three-dimensional structure using only a single 
monocular picture. People are quite good at understanding what 
is shown in a photograph; we should like to know how to make a 
machine do this. Now, with a very few exceptions, the many past 
attempts at computer analysis of scenes have been rather 


fruitless, and we should try to understand what went wrong. 


Almost all of those past attempts followed the same general plan: 


the picture is subjected to a sequence of transformations; each 
transformation is intended, in turn, to produce a successively 


more abstract representation until, finally, one obtains the 


Ay 
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desired description of the scene. Typically, such a sequence 


might be: 


1) Remove noise (by clipping, smoothing, etc.); 

2) Enhance features (by boosting gradients, etc.); 

3) Extract features (finding edges, vertices, etc.); 

4) Group features into objects (by regions, parallelisms, 
etc.); 


5) Identify objects (by partial matches, etc.). 


Although there is a great deal of plausibility to this idea of 
progressing relentlessly from local to global, the concept of 
serial stages of pre-processing does not actually work well in 
practice. It is simply not suited to the real problem. Errors 
and assumptions made at each level are passed on to the next, 
and, even if each stage is quite clever at how it handles its 
data, the accumulation of mistakes over many stages leads to 
chaotic over-all results. The basic grammar of the problem is 
too context-dependent. The appearance of an object's features 
(and even their occurrence or nor-occurrence) usually depends on 
global aspects of the arrangement and illumination of the scene. 


One must cope, for example, with 


1) Direct line-of-sight occlusion of parts of objects, 

2) Shadow occlusions that depend on the directions of 
lighting, 

3) Highlights, 
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4) Reflections, 

5) Textures, 

6) Decorations, 

7) Many other interactions between visual features and 


spatial forms. 


Accordingly, the inevitable ambiguity problems met at each level 
-- "Is this an edge or not?" or "Are these two features part of 
the same object?" -- are often not solvable at that level. One 
can select a plausible interpretation only by using a wider 
variety of knowledge about the real world -- knowledge that 
ranges from frinciples of optics: and geometry to knowledge about 


the particular environment and the objects likely to be in it. 


In the traditional processing sequence outlined above -- we shall 
call it the horizontal vision system -- different kinds of 
knowledge are implicit at each level. The principles of optics 
are involved because each visual point repr®sents a distribution 
function of space points in a way that depends on focus, 
scattering, reflection and noise. In the extraction of features, 
the processor must know whether the objects are likely to have 
straight edges, or texture boundaries, or polished surfaces. In 
analyzing a room, to give an extreme but real example, imagine 
the resulting chaos if the system did not know the significance 


of a picture frame! 


At the level of grouping features and identifying spatial bodies, 


we have already seen (Project MAC Progress Report V) how problems 

of projections and occlusion of three-dimensional objects Jead us 
away from the simple template-matching schemes that work fairly 

well for two-dimensional problems. We found, however, that many 

of these difficulties could be handled by symbolic-description 
systems, and we shall assume that the reader is familiar with Prof. 
Adolfo Guzman's work, either through the survey in Progress Report 

V, or through his Doctoral dissertation. We now conclude that the 
lower-level aspects of vision, too, are best treated as problems in 
artificial intelligence to be handled by a mixture of general methods 
and special knowledge. Because the different kinds of knowledge 
interact at different levels, we must provide channels for such 
interactions so that hypotheses about high-level things like ob- 
jects -- perhaps proposed by heuristics that use local evidence 

-- can be confirmed, rejected or revised by returning to other levels 
for other kinds of evidence. We use the term vertical system for 


this type of organization. 


We have not yet enough experience with verticality to discuss it 
abstractly. But we now know a substantial amount about its applica- 
tion to visual problems; the body of this section reports what we 


have found so far. 


Optical Anatomy of a Simple Scene 


Fig.3 


In Fig. 3, we see three cubes with dull white painted surfaces 
against adark background, illuminated by concentrated light from 
a lamp above and to the right of the camera. The data and 
methods used here are from work by Arnold K. Griffith. Fig. 4 
shows three plots of the light intensity, measured along three 


horizontal scans, each of 1000 points across the picture. 


650 


It is difficult to discern very much in these Plots. The next 
illustration, Fig. 5, shows the result of applying, to a sequence 
of such cross sections, a kind of second-derivative operation 
averaged over enough adjacent sample points to give flat plots 
over regions that have reasonably uniform gradients. (The photo- 
graph and the measurements were taken from slightly different 


positions.) 


aa 
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We have marked a number of typical features: 
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a is an outer edye. Against the dark background, it is 


Fig.5 


a simple, sudden change in intensity, giving a typical 
"bipelar" pulse 1n the smoothed second derivative. 


b is a more symmetrical feature; it is due to the 
"highlight" reflection on the front edge of the lower 


right cube. 


b' is a superposition of the effects of a b and a small a. 


c is the reflection of the bright surface of the upper 


Cube in the top of the lower right cube. 


Although the 


paint there is dull, this surface is seen at a low 


angle, and this enhances reflections. 


dis the crack between the lower cubes. 


measurements are all logarithmic, and truncated so that 


the piots won't overlap. 


The brightness 


e is the spot of dirt on the upper front corner of the 


Tower right cube. The edges and corners 


of objects 


often have highlights and often are dirty. 


f is a shadow boundary visible on the da 


f' is another shadow boundary, somewhat 
ecause of penumbra and depth of focus. 


. 


rk background. 


less sharp 
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The internal edges, such as at b, of uniformly colored objects 
often give small signals. Their width, in our plots, is an 
illusion due to the smoothing operation; on sharp corners, the 


highlight line is very narrow and “asily missed by a coarse scan. 


The history of attempts to write edge-finding and edge-following 
programs is long and inconclusive. Because the results were so 
obscure, Annette Herskovits (AI Memo 183) and Griffith made 
separate studies of the edges of geometrical objects; both 
concluded that the most common phenomena were superpositions of 
three effects (see Fig. 6): 


Fig.6 
(1) a simple step 


| 
a 


(2) a slope change 


(3) a highlight or 
a crack 
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They both investigated various detection filter methods for these. 
Griffith's thesis will include a theory of optical detection of 
edges under various assumptions about their intrinsic character and 
about the kinds of noise one might expect in a vision system. The 
most sensitive methods for detecting edges use two-dimensional 
operations, but these are very expensive with conventional hardware, 


because of the large amount of high-resolution information. 


eee 
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When a compromise must be made, it is not especially good simply 
to use a coarser homogeneous scan; for instead of the arrangement 
of Fig. 7, one can use that of Fig. 8, which has the same mean 


density but is better at catching thin edges. 


Fig.7 Fig.8 


Cr 


Co i 
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Now we apply this idea to some more realistic, cluttered scenes. 
We mark with short dashes the local maxima of the output of the 
edge-filter, along a mesh of fine horizontal and vertical? scans. 
The problem remains to convert this set of local features into 
lines, and then into objects. Figs. 9 and 10 show the results 
of a process that uses a projection operation sensitive only to 
straight-line segments. Although it is not good for close- 
packed features, it is conservative and does not propose many 


false lines. Griffith's thesis will give details. 
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RG x Fig. 10 


Another kind of process works in a complementary manner; it 
labels points whose neighborhoods are relatively homogeneous. 
Then the system finds the boundaries of the connected regions of 
such points. Thomas 0. Binford (AI Memo 182) describes 
experiments on such a system. There are many problems in 
deciding how to reduce the region boundaries to useful 
line-descriptions. We see in Fig. 12 the result of such a system 


applied to the relatively simple scene shown in Fig: is 


Fig.11 


Figs.12 
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Still another approach to line-finding uses a two-dimensional 
local-gradient detector, followed by a scheme for assigning the 
locally maximal features to edges, as in the early work of 

L.G. Roberts. Richard D. Greenblatt is developing a 


system of this sort. 


Problems exist at every stage of such processes. At each level, 
the selection of relevant features requires some a priori 
knowledge about the local world. Our verticality thesis holds 
that one cannot expect any one decision policy to work over a 
very wide range of situations, but that even a little feedback in 
this selection will help considerably. For example, each of the 
Systems mentioned above will miss some edges of some objects, 
because of the problems of resolution, illumination, focus, 
contrast, texture or noise. If the system misses an interior 
edge, the SEE program (or rather, one version of it) may have to 
propose one object in place of two, as in Fig. 13, or two objects 


in place of one. 


Fig.13 
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Assuming we are in a world of geometrical bodies, there is a 
variety of ways in which to use knowledge of the fact to propose 
corrections. (These will have to be verified, but proposing them 


is most of the battle.) 


Fig.14 


Missed edges, for example, are often related tn concavity (see 
Fig. 14), and, in a rather Bayesian way, this suggests a search 
for missed lines radiating from the concave vertices into the 


figure's interior. 


Fig.15 


In Fig. 15, we indicate proposed lines of several kinds: V 
directly to other vertices; — interior extensions of the 
vertex's edges; and L_ an (absolute) vertical edge (very common 
in real interior scenes). One might further propose parallels 


and lines that make confocal triplets. 
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It is remarkable how much can be done with such a line proposer. 
In the case of structures made entirely of rectangular solids, 
Prof. Manuel Blum showed (Fig. 16) that a remarkable number of 
interior edges can be reconstructed just from the outer profile 


of the scene. 


The numbers of lines proposed by such a scheme can be held 

to the order of tens, rather than of thousands. And the 

cost of verifying the existence of an edge with a specified 
location is enormously smaller than that of finding all such 
features independently, because -- for the same statistical 
confidence -- rejecting a particular null-hypothesis is always 
much easier than screening all of a large family of possibilities. 
The use of proposer-veri“ier system can thus reduce the total 


picture-processing effort by relaxing the tolerances on the 
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early, brute-force, feature-finding stages. In his forthcoming 
thesis, Griffith will give details of a complete system, already 
working, that does this. A first stage finds some of the edges 

in the scene. Then rew lines are proposed on bases of parallelisms, 
region completion, etc., and verified by tne detectors mentioned 


earlier. 


By using a priori information, one can often get much more out 
of a picture than might seem %o be in it. Assuming (correctly) 
that the sphere in Fig. 17(a) is uniformly colored and that the 


light comes from a compact source, 


Fig.17a Fig.17b Fig.17c 
one of Horn's programs is able to reconstruct the surface by solv- 
ing the appropriate differential equations. Then this program 
produces the stereoscopic pair of Fig. 17(b), (c). (Some readers 
will be able to fuse these by looking at a virtual point beyond 
the page.) The sharp shadow detail confirms, with the picture, the 
hypothesis of sharp illumination. Horn will present details of this 


program in his thesis. The method is quite complementary to stere- 


oscopy and focusing need inhomogenous surface detail or discontinuities. 


Ii-18 


Even if the surface is not of uniform color and texture, 
there is still much more that can be done with a monocular 
picture. Lawrence J. Krakauer is developing a system 

that analyzes scenes such as a bowl of fruit. The process 
begins by locating and analyzing illumination maxima; it 
appears that, from their intensity-shape behavior one car, 
in many cases, distinguish dull from shiny surfaces. 

Local maxima connected by an illuminated band are likely 
to be on the same object, ard Krakauer is testing some 
other heuristics for associating highlights with edges 


(Fig. 18). 


Digitized Image of Edges and Maxima Features 
Some Fruit Enhanced Alone 


Krakauer's goal is to discover visual characteristics of 
surface properties, and to establish heuristics for group- 
ing "non-geometric" features into natural objects, perhaps 


as Guzman did for geometrical objects. 
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The Vision System 


We have discussed Guzi.an's SEE program (Progress Report V); 
this program assumes a scene description in terms of edges, 
vertices and regions, and produces a proposed assignment 

of these features to a set of three-dimensional bodies. 
Guzman's thesis describes in detail a more advanced version 
of “nat project. It includes additional heuristics for 
linking parts of objects, analysis of the system's be- 
havior, discussion of and heuristics for correction of 
mistakes because of preo-processor errors, and some 
analysis of the system errors due to inherent ambiguities 
in ordinary scenes and in a variety of standard “optical 
illusion" scenes. Guzman's thesis also includes some 
observations about the problem of matching features be- 
tween stereo pairs. In particular, the following simple 
technique, when it is embedded in a vertical system, will 
solve the majority of such problems. Consider two views 

of a geometrical scene (Fig. 19). In any stereo pair, 

One can dissect the two pictures into sets of matching 
line-pairs, defined by the planes through the two eye- 
points. Any physical object visible to both eyes will 

be sandwiched between the same highest and lowest such 
lines, as suggested by Fig. 20. For two different objects, 


it is unlikely this will be true by coincidence, especially 


eecrwee ee 
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if the objects have more than one visible face, since the 
sandwich proposition is true also for each face! Thus , 
once enough point features are identified in the monocular 
pictures, one will have little difficulty in matching 

them between the pictures, and expensive cross-correlations 
should be unnecessary. This matching works well even when 


the two viewpoints are far apart. 


Fig.20 
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In another attack on stereoscopic vision, David N. Perkins, 
Jr. is developing a system that compares two views and pro- 
duces depth information. His matching procedure is compli- 
cated by the requirement that it be tolerant of the many 
kinds of errors that pre-processors are likely to make. 

It proceeds by matching vertices and arcs topologically, 
with some geometric constraints, and with a back-up and 
search procedure for finding maximal matches of substructures 
when there are possible local ambiguities. Then, given 

the best topological match, the program proceeds to analyze 
the arcs that are thus proposed for matching to obtain 
space curves. For example, the stereo pair of Fig. 21 are 
converted, by Binford's TOPOLOGIST system, to the views 


of Figs. 22 a and b. 


Fig.21 


Then Perkin's system, on request, produces views as seen 
from the right side and from above in Figs. 22 c and 22 d 
(shown here without suppression of lines that might be 


hidden by surfaces). : 
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Fig.22a Fig.22b 
Fig.22c 
Fig.22d 
} 
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We now have a complete horizontal system of programs con- 
necting Binford's topological pre-processor with Guzman's 
SEE program and on through tw Patrick Winston's new system 
(described later in the report) that recognizes some 
particular types of objects -- e.g., wedges, pyramids, 

and rectangular blocks -- and learns to identify some 


multi-object structure such as rows, towers and bridges. 


Professor Hosakere N. V. Mahabala developed the TOP- 
OLOGIST-to-SEE interface. This program, SETUP ("Pre- 
processor for Programs Which Recognize Scenes", A.I. 

Memo 177) gobbles a list of line segments and produces 

a complete topological description of the graph formed 
by the lines. Because such features as lines and ver- 
tices found by pre-processors have some uncertainty in 
location, there are usually serious problems in decid- 
ing when two edges are really the same, or in connecting 
edges and localizing vertices. SETUP centains heuristics 
for plausible guesses about such matters. All lines are 
treated as enclosed within strips of a certain width, 

and all problems about closure, intersection, containment, 
colinearity, etc., are resolved by procedures that are 
based on a single predicate about the orientation of a 
point with respect to one of these half-line strips. Al- 


though this may not be any better in performance than other 
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segment-joining and line-grouping criteria, it is probably 
no worse, and Mahabala's system is distinctive in having 
greater logical clarity than any other system we have 

seen before. It is therefore muci more likely to be im- 


proved by advances in theory! 


Finding the edges themselves is still a problem. When 

one knows, a priori, that they are straight, the criteria 
Griffith and Herskovits developed are probably adequate 
for practical purposes, aiid these are further subject to 
substantial heuristic speed-up innovations. For the more 
general problem of describing an ordered set of points 
(such as one obtains as the boundary of a "homogeneous 
region") as a curve, we need a systematic way to apply 
various kinds of a priori knowledge. At present, we 

have a variety of such attempts, such as POLYSEG (Griffith, 
A.I. Memo 131), the Greenblatt-Holloway line-finder (A.1. 
Memo 101), and thiee others, by Binford, Greenblatt, and 
Jayant M. Shah. Unfortunately, none of these is wel] 
enough understood to be considered theoretically firm. 

A variety of other general-purpose curve-segmentation 
procedures has been described in the literature. But 

no one has really come to grips with the basic problem 

of incorporating the relevant a priori information, and 


we conclude that this is one of the aspects to be faced 
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in designing the vertical vision system. 


R. Orban has developed a new program, ERASER, which detects 
and removes shadow boundaries from gemoetrical scenes. 
ERASER resembles SEE in that it uses the same classifica- 
tion of vertex features, but it also uses information about 
the relative brightness of regions. Its heuristics are 
based largely on the abundance of L, T and X types of 
vertices on the boundaries of shadow regions. ERASER is 
designed to criticize the output of SETUP, to remove shadow 
boundaries, and then to re-submit the result to SETUP 
before passing the problem on to SEE. It will not 

remove all shadows, but it is conservative about not re- 


moving real edges. Examples of ERASER performing well 


om 


are given in Fig. 23. 


ll 
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The ERASER system also contains some heuristics that Binford 
developed for guessing the direction Of illumination; this is 
used to bias the operation of ERASER and is available to the 
rest of the system. In geometrical scenes, the system can 
measure the angles between real vertical edges and those 
shadow boundaries that appear to lie on horizontal surfaces 
in order to get a quantitative measure of illumination 
angle. As an application of verticality, the operation of 
both ERASER and SEE could be enhanced by verifying, at a 
higher level, that some of the shadow boundaries lie across 
otherwise uniform surfaces (say, by using Perkins's stereo 
system) or by verifying, at a lower level, that the proposed 
inner shadow boundaries are less than sharp, i.e., have 


penumbras. 
Mechanical Structure Analysis of Visual Scenes 


Blum and Griffith have written a Program that can analyze the 
stability of the three-dimensional Structure of rectangular 
blocks. The program uses this analysis in a planning scheme 
to propose the order in which the Structure is to be built. 
Even in manipulating toy blocks, there are problems; one can 
ask which of these structures can be constructed with one 


hand. 
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Referring to Fig. 24, the one on the left cannot be built, 
the program asserts, because there is no stable three-block 


sub-part of the structure. 


Fig.24 


But, with another interpretation of the rules, one could 
first assemble the upper three blocks on the floor, then 
lift them into position. We expect our more advanced 

construction-planning programs to be able to use more ad- 


vanced strategies in which sub-assemblies are so identified. 


Winston is completing a program that learns to recognize types 
of structures from sequences of examples. We consider it 

to be a major advance in the areas generally known as concept- 
formation, or machine-learning. Given a scene (represented 
by a collection of regions, as produced at the output of the 
SEE program), Winston's program attempts to describe the scene 
in terms of elementary objects and already-known sub-structures 
and relaticns, using a descriptive language reminiscent of that 
Dr, Thomas G. Evans used (Ref. 2), but Winston's language 


is further developed. 


a 
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For example, the scene of Fig. 25 leads to a description like 
that shown to its right. We tell the program this is a 


picture of an ARCH. 


mt . : 
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Next, we inform it that the picture of Fig. 26 is not-an- 


ARCH. The program then proceeds to compile a new description 


(of ARCH), as shown to the figure's right. 


This new description of ARCH is obtained by comparing the 
descriptions of the two scenes, thus providing a second-level 
description. The must-not-abut relation is the outstanding 
difference it discovers, and it modifies the description of 
ARCH to require that the abutting relation not hold between 
the two supporting blocks. (Winston's program is equipped 
ab initio with heuristics for proposing support and non-sup- 


porting contact.) 


sao 
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Now, when we give the program the next figure (Fig. 27) as 
another example of not-an-ARCH, it again modifies the ARCH 
description, this time requiring (rather than just mention- 


ing) the supported-by relations. 


Again, this change occurs because the change in support 
is the most prominent difference the comparison program 
found. Finally, we show it one more example of an ARCH, 
and it re-compiles a description in which the requirement 
that the top-object be a rectangular brick has been re- 


moved (Fig. 28). 
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There are many arbitrary elements in how this program decides 
what to do at each step -- which differences to give highest 
priorities, how to match up different description networks, 
what explanations or excuses should be assigned to the dif- 
ferences it notices. These commitments are kept on back-up 
trees, so that it is possible for the program to recover from 
at least some disasters. The goal is to obtain behavior that 
would be plausible in, if not typical of, a child. The par- 
ticular concept that the program develops from a certain 
sequence of examples will depend very much on those examples 
and on the order in which they are presented -- as well as 

on the connection of concepts the program has already ac- 
quired at that time. The experimenter will not always get 
the results he wants or expects! We cannot expect the first 
real concept-learning programs to be foolproof, any more than 
a teacher can expect his favorite instructional technique 
always to work; but here at last we have a chance to under- 
stand precisely how a training sequence interacts with 
built-in and previously acyuired ingredients of the system. 
Winston's methods are related to earlier ideas like those in 
Newell and emphasize the use of explicit descriptions. While 
there is some similarity, in Strategy, to the Feigenbaum- 
Simon EPAM scheme, the result is pointed toward a true struc- 
tural network description of the concept. It should there- 


fore lend itself better to direct anaiyses of examples, 
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instead of having to work through synthesis by generating 


and testing proposed examples. 


More generally, having a description (rather than just a test) 


for a concept seems absolutely crucial. The advantages include: 


1) The possibility of making deductions about the 
concept, including its consistency with a proposed 
detection scheme 

2) Combining several descriptions in non-trivial ways 

3) Comparing and contrasting descriptions, as in 
Evans's program 


4) Using the description to generate, rather than 


merely to select, what next to do in a search program. 


Similarly, in search processes, one could combine several 
descriptions to obtain summaries of the information acquired 
in exploring different alternatives. In most earlier heuristic 


search schemes, the procedures usually have to abandon almost 


all information acquired in the course of unsuccessful explora- 


tory attempts because of inadequate descriptive facilities. 


The A.I. Group is now committed to a braod attack on the problems 


of symbolic learning, through the application of heterological 
kinds of knowledge to the analysis of descriptions. An essay 
(A.I. Memo 185) gives a preliminary statement of our plans for 


this project. 
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Theorem-Proving 


Gerald J. Sussman has implemented a theorem-proving program that 
uses the Resolution prirciple with an assortment of retrieval 
methods and other heuristics to make deductions in predicate 
calculus. Although there has been a number of interesting 
results, we nevertheless believe that the use of this technique, 
as an approach to artificial intelligence, is receiving much 
undue attention today, and we do not plan to give it a large 
place in our future activity unless some new and impressive 
demonstration of its power comes to light. Sussman has been 
experimenting with a variety of means for combining the deductive 
strength of predicate-calculus resolution with heuristic flexi- 
bility of less formally constrained problem-solving schema, 
but his conclusions are not encouraging. An example that puts 
one of the problems into a nutshell is this: suppose one is 
given 

A => B and 

c =>o, and 

(A ==) & (C S50) SS, 
and one wants to deduce the simple conclusion 


Es 


To be sure, the program manages eventually to obtain E, but only 
after many steps of converting the given statements into expanded 


"normal" forms that are in themselves rather meaningless. 
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This would not be so bad in itself, but it would become an acute 
problem if one were to try to give the theorem-prover additional 
advice about what to do in other situations, since the kinds of 
Situations for which we can describe such advice are hard to 
represent in terms of the fractured internal expressions the 
Resolution system creates for its own use. In another experiment 
Sussman modified the representation of this problem so that the 
=> symbols were not recognized by the prover as meaning implies, 
and he added a separate set of axioms for using a more natural 
deduction method. Now the Resolution system produced a better 
proof in less time, even though it had to work indirectly 
through the new axioms! No doubt this particular problem 

could be ameliorated by some variant of the many combinatorial 
schemes that are being widely studied today, but we feel that, 
Once such systems attempt to solve "real" problems, all such 
devices will fail as mere Stop-gaps; they do not help one to 
come to grips with the construction of the kinds of cognitive 
models we think are needed to solve hard problems by using 
accumulated knowledge and experience. Sussman's system takes 
some steps in this direction by maintaining its statements in 

a structure partially ordered by the substitution-instance 
relation. Nor is his system restricted to first-order pred- 
icate calculus. But our gloomy expectations remain. In- 
cidentally, we do not feel that completeness, or even con- 


sistency, is of very large importance. Logical completeness 
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is more or less inevitable in any system that knows a great 
deal, and consistency is not a notable feature in the intelligent 
machines that already exist. (The backers of the Resolution 
method achieve the wrong kind of completeness; the kind of 

completeness we need is for the problem-solver to be able to 


use any "natural" technique. ) 


At perhaps another extreme, Carl E. Hewitt is developing a 
language for constructing deductive systems with the utmost 
heuristic flexibility. His language, PLANNER, is designed to 
allow use of knowledge and ty cs of representations of great 
variability. PLANNER must be one of the least procedural 
languages: to a large extent, one can specify what one wants 
done rather than how to do it. Consider, for example, a 
Statement of the form "A implies B". As it stands, it is a 
simple declarative statement. But in PLANNER it can instead 

be interpreted as the imperative: "set up a Procedure that will 
see if A is ever asserted, and if this happens assert B also." 
Or, it can be interpreted: "set up a procedure that will see 
if B is ever desired as a goal, and if so assert A as a new 
sub-goal to be deduced." This is only the skeleton of the 
idea: PLANNER contains machinery for easy manipulation of 

many different roles of declaratives, imperatives, goals and 
deductions. In attempting a particular deduction, for example, 
Programs can specify suggestions for other theorems that should 


be used (and even in what order) to make the deduction. All 
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Statements are expressed in a powerful new pattern-matching 
language, MATCHLESS, in which control of procedures is 
Specified in terms of the forms of assertions, rather than 


in terms of particular assertions. 


Because problem-solving often requires building up elaborate 
temporary structures, PLANNER has machinery for handling 
Statements that were once true in a model which may no longer 
be true after ections have been performed, and for drawing 
conclusions that may have to be deduced from such a change. 
Control of such matters is specified in terms of local states, 
to which are bound information about changes in the data 

base -- erasures, assertions, new definitions, etc. -- since 


the data were last updated. 


Hewitt describes his system qualitatively in a conference paper 
and in detail in A.I. Memo 168. He has now implemented the 
language and is programming a compiler to obtain sufficient 

Speed to permit full-scale experiments. At present, he is using 
it to study deductions about the manipulations of objects by 

a robot, as in "pick up all pairs of cubes that are the same 
color, with one cube on top of a third cube that is in front of 
the other." Terry A. Winograd is completing a system that will 
translate such natural English statements into PLANNER assertions 


and theorems. 


«> 
“> 
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Natural Language Systems 


Winograd is completing a system for handling problems of 
computer understanding of natural language. The system is 
designed to accept information in normal English sentences, 
to answer questions, and to execute commands, using semantic 
information context to understand pronoun references and to 
disambiguate gramatically complicated texts. To do this, 
Winograd uses a new linguistic scheme, based partly on 

the systemic grammar described in Halliday (1967) and in 
Winograd (1968) and partly on a special representation 

for both the grammar and the semantics. In previous attacks 
on such problems, some workers have used heuristic tricks 

-- key words or matching of phrase fragments -- or they have 
used non-heuristic formal grammars to do a detailed analysis 
of the sentence to which the semantics are to be applied. 
Winograd's system is based on a heuristic grammar that uses 
contextual information in analyzing the sentence, carrying 
out the semantic analysis concurrently; this is made possible 
by representing the grammar as a program instead of as a 


set of static rules. 
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Definitions of words, as well as the system's knowledge about 
things, are also stored as programs and are available to a 
deductive part of the system. This allows much more flexibility 
than one gets naturally from networks or rule-lists. The program 
can make long, complex deductioits in answering questions or in 


absorbing new information. 


Tne grammatical part of the system is operating, with a quite 
comprehensive English grammar (for a description, see Winograd, 
“PROGRAMMAR, A Language for Writing Grammars", A.I. Memo 181). 

The semantic programs are still in preparation, to be combined 
with the deductive system, which will use Hewitt's PLANNER 
language. The entire system could be used for a general question- 
answering facility for any corpus of knowledge programmed into 
the deductive system. The first applications will probably be 
concerned with instructing a robot and with analysis of children's 


stories at the first-grade levels. 


Loosely Stated Mathematical Problems 


Several years ago, Daniel G. Bobrow completed a program that was 
able to solve some algebra problems stated in ordinary English. 
The mathematical material was rather sharply restricted to 
converting the sentences into simultaneous linear equations. 
Recently, Eugene Charniak has completed a new program that can 


solve some problems like this, stated in imprecise English: 
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A train, starting at 11:00 a.m., travels east at 45 

miles per hour while another, starting at noon from the 

same point, travels south at 60 miles per hour. How 

fast are they separating at 3:00 p.m.? 
Solution of this problem requires a number of intellectual! 
skills. Among these are, of course, the ability to manipulate 
the English text to derive a well-formulated symbolic provlem, 
and knowledge of elementary calculus necessary to solve the 
symbolic problem. Perhaps less obvious, but equally important, 
is access to miscellaneous knowledge about the real world: for 
example, the knowledge that east and south are orthogonal 
directions, and enotgh knowledge about time to deduce that 3:00 
p.m. is four hours later than 11:60 a.m. 


Fig.29 ae 
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VALUE: 15 HOURS 


Charniak's system, CARPS (CAiculus RAte Problem Solver) (TR-51, 


App. B), translates this problem into an internal representation 
of which a general impression can be gleaned from Fig. 29. This 
structure indicates two objects, TRAIN and ANOTHER. Associated 
with each is a velocity given as direction and magni tide, a TIME 
that has a starting value for each train and a WHEN-CONDITION. 
The latter refers to the fact that most calculus "rate problems" 
ask questions roughly of the form, "What is A when B?"; in our 
Case, we obtain, "What is the speed at which the trains are 


Separating when the time is 3:00 p.m.?" 
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CARPS uses this information structure to formulate an algebraic 
manipulation problem. HOW FAST and SEPARATING indicate that what 
is desired is the rate of change of the distance between the two 
objects. The distance is the hypotenuse of a right triangle with 
legs toward AST and SOUTH. The lengths of the legs are obtained 
by multiplying the rate of travel by the time. The expression 


for the derivative is therefore 


d 
ae OV (45[t-119)2 + (60 [t-12])2 


The desired value is at 3:00 p.m. (our WHEN-CONDITION). The 
derivative is found by algebraic manipulation, and the program 
then types 

THE ANSWER IS 72.246 MILES/HOUR. 
For its algebraic operations, CARPS uses parts of the MATHLAB 
programs described elsewhere in this report. CARPS has been used 
to solve other problems taken verbatim from calculus textbooks. 
These problems deal with cones, spheres and shadows as well as 
distances. In each case, the program was given sufficient 
knowledge to parse the sentences, set up an internal structure, 


and generate the equations. 


We do not wish to imply that the program is very strong at 
solving calculus problems. Frequently, a problem cannot be 
solved because the program is unable to handle the syntax used, 
no method of solution is known to the program, or the problem 
requires facts about the real world which the program does not 


know or could not handle. 


a 
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An example of the last dif’iculty arises in: 

A ladder 20 feet lone leans against a house. Find the 

rate at which the tov of the ladder is moving downward 

if its foot is 12 feet from the house and moving away 

at the rate of 2 feet per second. 
Most adults have little difficulty in visualizing the situation 
described in this problem. CARPS is stuck because it does not 
know in which direction the foot of the ladder is moving. The 
phrase MOVING AWAY is interpreted by neople to mean "moving away 


along the horizontal ground on which the foot of the ladder is 


presumed to rest." CARPS, however, does not realize that. 


Clearly, CARPS's lack of such real-world knowledge cannot be 
circunivented or ignored. A crucial rart of research towards 
problem-solving ability of this sart is concerned with the 


representation and use of such knowledge. 


orl 


We have discussed (AI Memo 185) preliminary ¢tudies attempting to 
analyze the sorts of knowledge used by a first-grade child in 
understanding children's stories. Much of th2 calculus student's 
"general knowledge” is already structured at the 
elementary-school level, and we feel that progress in this area 
is essential to the future development of anything like "general 
intelligence". We have decided to make th’s area a very large 


part of our work in the next few years. 
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MATHLAB 


MATHLAB is an interactive computer-program system that is 

being developed to facilitate creative work that involyas ex- 
tensive manipulation of symbolic algebraic expressions. MATH- 

LAB is intended to be of help to research inathematicians and 

to appliers of mathematics engaged in "heavy manipulative work." 
In the MATHLAB project, therefore, serious attention has been 
paid to human engineering and to efficiency and speed of opera- 
tion as well as to basic niathematical problems and to the problem 
of programming the computer to exercise initiative and some 
judgment in selecting and carrying out transformations of al- 


gebraic expressions. 


MATHLAB is our best illustration of the idea of building 
"knowledge" into computer programs. The MATHLAB programs are 
actually quite capable in solving certain nontrivial mathe- 
matical problems. The essential idea, however, is to include 
provisions for interaction with that knowledge so that the user 
can build and administer complex but well-understood procedures. 
At best, the user and the programs supplement and reinforce 

one another and march rapidly through symbolic-manipulation 
problems that would take the unaided mathematician countless 


hours. 


This last year, a new and much advanced MATHLAB was planned 

and partly implemented. Progress was made on faster parsing 
algorithms and on a representation of polynomials that speeds 

up addition and multiplication. At the same time, a significant 


advance in programmed symbolic integration was achieved by 
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implementing a decision procedure, due to Risch, for expressions 


involving rational functions, logarithms, and exponentials. 


Also this last year, an extension of MATHLAB to handle special 
functions defined as integrals has enabled Moses to verify or 
correct some published tables of integrals. In 1958, 

W.D. Maurer of the Argonne National Laboratory compiled a table 
of 150 integrals involving the error function, erf(s). Maurer 
was unable to verify, by hand, the following integral, which he 
included in the compilation with a note to that effect. In 


fact, it was slightly incorrect as printed: 


xZerf(ax + b)eP’Xdx = tert (ax + b)ePX(2 - 2px + px? ). 


p 


+ Pax_- pb + p2/2a - 2a a7 (axtb)?+px 


4 3 a Se 3 
- Pla = 4p'b + dab" p” — Zap" + gatbp + 82° Q-p/ale = blenetax +b - 2) 
4a3p : 


Moses' program was able to find the error. 


Here is an example of factorization of a polynomial: 
(1) x**6- 1=0 (typed in) 
(2) x6. 1 = 0 (MATHLAB's response) 


In line 1 the user typed an equation. (The syntax is awkward 
because the typewriter is a one-dimensional device.) Line 2 
shows the computer's response displayed in the conventional 


two-dimensional syntax of mathematics. 


SA SER -<— - 
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(3) 'PF('WS) (input) 
(4) (x=1) (x41) (x2+x41) (x2-x41) (response) 


In line 3 the user asked the system to factor the equation in 
line 2 by requesting the operation PF (Polynomial Factorization) 
to be applied to the WS (Work Space). The Work Space is always 
the last expression known to the system. In the response given 
in line 4, the quadratics are not fully factored because the 
program is restricted to finding the smallest factors which have 


integer coefficients. 


Next we shall ask the system to integrate an expression, 
differentiate the result, and then check to see whether or not 


the result is identical with the original expression. 


(5) 1/(x**3 + A*x**2 + x) (input) 
] 
(6 = ae (response 
x3 + Ax? + x ‘ 
(7) '‘integrate('WS, x) (input) 


(8) IS THE EXPRESSION 
ae 
to be considered positive, negative or zero (response) 


(9) NEGATIVE (input) 


The computer wants to avoid terms in the integral which have 
complex values, if possible. This is the reason for the 
question posed in line 8. Line 9 is the user's response, 

and line 10 is the integral. Obtaining the result involves 

use of a subsystem for handling partial fraction decompositions 


and one for integration of rational functions: 
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(10) - 1/2 log(x® + Ax + 1) 


+ —_ arctan 2X + A _ + log(x) 
sqrt(-A°+4) sqrt(-A°+4) 


We shall now differentiate the integral. 


(11) "DERIV('wS, x) (input) 
Te) - 2A ~ 1/2(2xtA) , 1 
a ae, p IA 
({2x+A) + 1)(-A2+4) axAH . 
2 
-AP+4 


Well, the result is certainly not identical to our original 
expression (line 6). This is because the differentiation 
program differentiates a sum term-by-term without combining 
the results. (Note that the log x term in line 10 gave rise 

to the term in line 12.) To our rescue comes a simplification 
Program RATSIMP (RATional SIMPlification) which will expand 
denominators and combine the entire result into a single 
fraction. This fraction is simplified by removing the greatest 


common divisor of the numerator and denominator. 


(13) 'RATSIMP('WS,x) (input) 


(14) 1 (response) 
x he” + & 


Here is another integration problem: 


(15) **3/ C1 x**2 )**(3/2) (input) 
(16) x3 (response) 
(Rx2) 9/4 
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(17) ‘integrate('WS, x) (input) 


(18) ] 
sqrt (-x¢ +1) 


2 


sqrt(-x" + 1) + (response) 


2 + ] over the more conven- 


Note the system's preference for -x 
tional 1 - x2, Mathematicians tend to obey the rule: Ifa 

sum of two terms contains a positive term and a negative term, 
write the positive term first. Graphic heuristics of this sort 


are now included in Martin's more sophisticated display routines. 


(19) (28X*H6 +5 HAD AASEA RN HAZE] ) / (X#AZ+]) AAD HRY HHL 


(input) 
(20) 2x0 + 5xt ; x3 + axe +t 1 x2 (response) 
(x + 1) 
(21) ‘integrate('WS, x) (input) 
(22) 2x + 2x3 +1 axe (response) 


2x2 + 2 


These results depend on a powerful pattern-matching program 


for algebraic expressions. 


This example points clearly the need for a better way to enter 
mathematical expressions into the system. Methods which allow 
users to hand-write algebraic expressions, using a pen-like 


device, are now close to the step of useful application. 


The following example shows the system solving a linear differ- 
ential equation with constant coefficients, a problem-type of 
great interest to electrical engineers. 


(23) DERIV(y,x,3) + A*DERIV(y,x) = sin(2*x) 
(input) 
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3 
(24) oye apy = sin(2x) (response) 
Dx . 
(25) ‘'LDESOLVE('WS, y, x) (input) 
(26) NEED INITIAL CONDITIONS (response) 


The program allows one either to enter specific initial values 
for Y(0), Y'(0), and Y"(0) or to leave them as indeterminates 


(as is done below). 


(27) ALLFORMAL (input) 
(28) IS THE EXPRESSION 
A 
TO BE CONSIDERED POSITIVE, NEGATIVE OR ZERO 
(response) 
As before, the answer to this question will be used to generate 


an answer which contains no complex terms. 


(29) POSITIVE (input) 
(30) 2Y(0) + 2Y"(0) +1 
2A 


4 2(O)A + 4¥"(0) + 2 


5 cos(sqrt(A)x) 
A’ - 4A 


+ Y' (0) sintsar Hal) + oe cos (2x) 
(response) 


The solution is obtained with the help of a subsystem that takes 


Laplace transforms of both sides and obtains the inverse 


Laplace transform of their ratio, using the package for integrating 


—<p 


eee 
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rational functions. 


Hierarchical organizations such as MATHLAB's will become increas - 
ingly popular as programs and systems get more complex and 
sophisticated. While this system is ostensibly working at the 
higher levels of the hierarchy, it is actually Spending most of 
its time at the lower supporting levels. Probably, over half 

the routines of its more than 60,000 words of memory are being 
utilized in solving a differential equation. This should serve 

as a warning to designers of time-sharing systems who would prefer 


that programs limit their use of memory. 


The field of computer-aided instruction (CAI) has not progressed 
to the extent that many have wished. Teaching programs are 
unable to answer questions other than those which the designer 
had foreseen -- not only because these programs do not know 
enough English to understand the question, but basically because 
they do not at all understand the field about which they are 
Supposed to teach. The designer of the usual kind of CAI course 
writes a script which is controlled by a context-independent 
interpreter. The script designer must provide for many possi- 
bilities at each step because he cannot rely on the interpreter 
of the script to know anything about the field with which the 
script deals. If a student is allowed, as he rarely is, a rea- 
sonable flexibility in replying to the Program, then either the 


script designer is physically exhausted from considering all the 
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Plausible replies, or he misses considering many such replies. 


We contend that a primary step in writing a sophisticated teach- 


ing program should be the education of the program in the area 


about which it must teach. 
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Consider, for example, the problem of writing a program for 


the teaching of freshman calculus techniques for differentia- 


tion and integration. Using the knowledge of these techniques 


that is imbedded in the MATHLAB system, it would be possible 


to write teaching programs that check the steps of a student's 


calculation, advising him of errors as he progressed. A pre- 
liminary program with these facilities has been prepared. 
Such a program could also be capable of performing differ- 
entiation or integration problems suggested by the students, 
explaining its steps as it proceeded. To achieve such 
capabilities through the writing of mindless scripts is 
unthinkable. To be sure, the experiments with the Calculus 
Rate Problem Solver (CARPS) have shown that a great deal more 
must be known before such programs can deal effectively with 


human beings in a wide area of knowledge. 


The task of finding out what huinan users know is clearly 
related to the tasks of workers in the field of linguistics, 
psychology, and philosophy. A major failing, however, in 
the education given in those traditional disciplines is the 
lack of understanding of the concept of an algorithm or 
process. We belizve that advances in our understanding of 
human knowledge as algorithmic processes are likely to be of 


supreme importance. 
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During the latter part of the proposal period, MATHLAB will 
interact strongly with the planned work in computer graphics. 
Although the most essential part of MATHLAB is conceptual and 
the next most essential is the programming of symbolic trans- 
formations, the interface with the user is very important, 
and we look forward to giving MATHLAB every advantage of 


effective man-computer- interaction techniques. 
Chess 


MACHAC-VI, a chess program developed by Richard D. Greenblatt 
for the PDP-6 computer, was begun in November 1966. It en- 
tered its first tournament the following January. Since then, 
it has participated in four more official U.S. Chess Federa- 
tion tournaments and has achieved an official rating of 1528. 
In its last tournament, it had a performance rating of 1720 
and drew an 1880 player. These statistics put it a little 
more than one standard deviation below the mean strength of 
all U.S. tournament piayers (which is about 1880). The 
machine has played perhaps 2000 games against chess players 
of every strength. It wins about 86 per cent of its games 
against tournament players. Greenblatt estimates that it 


represents a programming effort of about six man-months. 


Once the basic program was completed, the first step in 
refining its play was to give it a model of the flow of 


power on the chess board. Next, it was given explicit 


_ beeen et 
lettre aE penta tlle 
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techniques for pins and discoveries; then it was given positional 


and pawn structure considerations. These and other features 


are largely completed now. 


The next major step, only partially implemented now, will be 
to incorporate symbolic reasoning ability into the program. 
The opponent's threats are speci“ically identified, and each 
repiy is evaluated for its ability tp answer some or al] of 
the threats. This does not mean that the opponent's threats 
were ignored before; but now the program proceeds with a more 
explicit sense of what it is doing, whereas in the classical 
minimax strategy the threats are implicit and therefore cannot 
be subject to symbolic statements of what to do. The explicit 


system should be much more efficient. 


Another new feature, already implemented, adapts the 
plausible-move system to recognized states of the game: 

if one side appears to be ahead in the look-ahead analysis, 

it will prefer holding, trading and simplifying moves; the 
other side will require positive, attacking moves that might 
yield tangible gain. These and other features are operational, 
but the other programs have not been modified tc take as much 


advantage of them as they probably could. 


Another class of improvements is planned which will include 
real symbolic learning rather than mere tuning-up. We expect 
that these, with improvements in end-game strategy, will result 


in a substantial increase in strength of play. 
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Eye-Tracking 


The much-used term "man-machine interaction" usually refers to 
situations in which the communication relationship is exceedingly 
lopsided. Man can see, hear and touch the computer in many ways 
and places, but the machine is restricted to receiving informa- 
tion through a narrow bottleneck -- usually a Teletype. We have 
been interested for a long time in redressing this imbalance. 
This year we were able to achieve an old goal of enabling the 
machine to look at a person. More precisely: the machine looks 


at the man's eyes to determine his point of fixation. 


Recording eye movements is a well-established technique in the 
study of perception, tracking skills, and even problem-solving 
behavior. But traditional methods give the pattern of eye 
movement only after analysis. We believe our system, which 
incorporates an eye-tracking device developed by J. Merchant: 
of Honeywell under NASA contract, is the first that enables 

a computer to use the information in real time. As an example 
of its uses, Samuel L. Geffner has two programs that display 
text for a subject (such as a small child) to read and take 
action related to the word currently fixated. One of the pro- 
grams causes the word to be pronounced by the computer; the 
other causes the word to be replace, in the display, by its 
translation into another language. These programs are mere 
demonstrations -- but they illustrate rich applications to 
teaching and other interactive situations. We are pursuing 


such applications, partly with support from NASA. 
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The A.I. Time-Sharing System 


The experimental work of the Artificial Intelligence Group re- 
quies a high level of computer service for a limited number of 
users. Our system is based on time-sharing’ a two-processor 
machine (PDP-6 and PDP-10) with a core memory of 218 words of 
36 bits. Normally, the programs of the active users remain in 
fast memory; this limits the number of users now, but a paging 
device to be completed early in 1970 should allow some expansion 


required by the maturation of certain projects, notably MATHLAB. 


The special requirements of the project led to a time-sharing 
system with enough novel features to merit discussion. Donald 
E. Eastlake describes the system in detail ("ITS 1.5 Reference 
Manual", A.I. Memo 161a). Besides the usual kinds of input- 
output and system calls, there are a number of special calls to 
reduce overhead and to facilitate real-time control. These 
include programs for operating the mechanical hands and com- 
puter eyes and other special remote-control devices. Al] 
ordinary time-shared programs run in one of the processors, 
and critical real-time processes are assigned, by user calls, 
to the other. The real-time processor is normally assigned 


to a single user at a time. 


Because all user programs ordinarily reside in core, it is 
possible to switch between programs with great rapidity. Quanta 


of user time are short enough to allow program response to single 


et eR rn ee ee ee 
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typed characters without. noticeable delays. When swapping is 


introduced, we expect it to affect only semi-dormant users. 


A user may have many jobs running "simultaneously". Each user 
commands a tree of procedures; each can create and control sub- 
ordinate procedures; all have equal access to external devices 

and files. The top procedure, loaded automatically when the user 
declares his existence, contains a version of the wel 1-known 

DDT debugging system. A special character allows transfer to 

the top procedure from any level, so that the user is automatically 
in position to use DDT's interrogation, breakpoint, and other 


debugging features. 


Input-output devices are referenced symbolically and data can be 
transferred on character, word or block bases; an entire video 
image can be acquired by a system call while the calling pro- 
cedure continues without Waiting. User programs need no 
buffers in their own core images. Line-printer requests, for 
example, are buffered until the device is free. User programs 
communicate by "software interrupts" that are treated the same 
as hardware interrupts; superior procedures can store and re- 
trieve words in inferior procedures as though they were 1/0 
devices; buffered communication is provided so that pairs of 
procedures can treat each other as input devices. The file 


system resembles the CTSS file system. 


4 
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The schedule algorithm tries first to equalize time between 
users, and second to equalize times between the procedures of 
a single user tree. A special procedure always runs which per- 
forms various jobs and can check constant portions of the 
system against a copy in an attempt to detect some forms of 


hardware or system failure. 


The secondary storage uses IBM 2311 discs and DEC microtape 
drives which are file-structured in the same way. Users can 
establish symbolic links between file directories so that files 
can be shared by many programs and many users; these links can 


be chained. 


The system has several operating stations. These include four 
text-display devices, several Teletypes and external telephone 
lines, a main console with DEC 340 display and several slave 
monitors around the laboratory, and a special console with 
controls for operating the eye-hand system. The system in- 
terfaces with a radio transmitter for inexpensive long-distance 


remote experiments. 


A multiplexed digital-analog system operates either in word 
Or block mode, on call or automatically, as requested. Any 
number of users may simultaneously have analog access on 
different channels in different modes. These are some of the 


real-time facilities: 


ie = + 
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1) Iris, focus, stereo mirror, for high-resolution image 
dissector 
2) Pan, tilt, zoom, focus, iris, for small, low-resolution 
image dissector 
3) Extend, tilt, rotate, grasp, finger curl, for hand MA-3 
4) Extend, tilt, rotate, grasp, for hand MA-2 
5) Roli, yaw, horizontal, vertical, swing, for arm MA-2 
6) Position for tactile-sensor device 
7) X, Y, Z, rotate for arm MA-3. 
For moving the arms, special system calls provide acceleration- 
and velocity-limiting, software limit stops and other performance 
limits. To prevent disaster, certain motion commands are un- 
interruptable for limited times, and an arm-moving call is 
illegal if the clevice is under control] of another user. Motion 
calls are not automatically buffered like those to the line 
printer! Although one can read any input channel at any time, 
most mechanical devices can be operated at leisurely speeds, 
with the input multiplexer channels read automatically. The 


system normally does this at a minimum of 50 times per second. 


On the input side are sensors for all mechanical degrees of 
freedom, including hands and arms and optical parameters. 

There is a test stand for calibrating the positional control 

of the arms, and there is a variety of portable remote-control 
boxes with switches and continuous-adjustment knobs. A special 


system call gives the user great flexibility in real-time control 
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Of program parameters. It connects potentiometers (through the 
input multiplexer) to arbitrary program variables -- i.e., memory 
registers -- and these can be specified to be fixed or floating, 

or arbitrary bytes. There are options for assigning absolute 

or incremental meanings to knob positions; in the incremental mode, 
there is a programmed velocity-dependent gain and a side-to-side 
hysteresis so that it is easy to make fine and coarse adjustments 
with the same control, and it tends to remain centered in its 


motion range. 


The system contains good facilities for graphic display, which 
are presently limited to one user. Growing needs dictate 


acquisition of multiple-display hardware. 
LISP (MACLISP) 


LISP is the high-level language the Artificial Intelligence Group 
uses. Our LISP version is different in many ways from other 
LISPs, and some of these differences represent progress. The 
detaiis are of interest only to specialists, who may consult 

the memo by Jon L. White ("Time-Sharing LISP for the PDP-6", 

A.I. Memo 157). The system does not use an A-list for ordinary 
variable bindings; it uses a value property; a special stack 

is used to unbind after lambda conversions. There are many smal] 
improvements in human engineering such as default values of 


non-specified function arguments. The garbage collector manages 
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space for arrays. The READ program allows new character-handling 
facilities and macro definitions of certain kinds. There is a 
variety of powerful editing facilities and display packages, and 
some strong debugging systems. The machine-1anguage feature, 
LAP, has been Strengthened. Whitfield Diffie has restructured 
the compiler to make the most of its decisions by dispatching on 
tables so the users can introduce special forms by giving the 
compiler instructions through the table entries. It would 

be more efficient if the compiler were able to make a deeper and 
more systematic analysis of the uses of expressions being com- 
piled, separating executions for value from those for effect 

on flow of control, for side effects, etc., and for recognition 
of unnecessary recursion and similar simplifications. This 


itself is a problem in artificial intelligence. 


Motivated by MATHLAB and other projects, we are now making an 
effort to introduce facilities for very fast arithmetic into 
LISP. We expect that the resultine system will be almost as 
efficient at number-crunching computation as is any conventional 


algorithmic language. 
Mechanical Hands and Arms 


The project has developed several] mechanical effector devices 
for computer-controlled manipulation. This field is still 
substantially unexplored, and our work can be considered only 


to clarify some of the problems, not to solve many. There are 
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problems in several different areas. 


There is a need for much deeper analysis of what is needed in 
designing a hand-arm system for various kinds of jobs. In his 
thesis, David L. Waltz studies motions the human hand uses 

to perform several basic tool-handling and similar tasks, to 
see which of the degrees of freedam of the hand were most 
essential, and how they were sequenced. There is some in- 
formation about this in the orthopedic literature, but it 

is not sufficiently structured to serve as a base for programs. 
One needs to describe actions in terms of interactions of 
position, force, velocity and sensory responses -- in short, 
one needs something like a programming language for it. We 
also examined some choreographic languages but, although they 
custain some good ideas about path-of-motion description, they 
do not have sensory-interaction predicates. Waltz developed 
some notation that seems helpful and Ernst's discussion is 
still relevant. Waltz's thesis shows how a few degrees of 
freedom can accomplish muci, but one wishes there were much 


more Known about this subject. 


In most of our experimental work, we have employed a modified 
AMF Versatran industrial manipulator arm (which operates in 
cylindrical-polar coordinates). Although the hands we attached 
to this device have some extra motions, the large-scale motion 
of that type of arm has no such redundancy; thus there is 
basically only one way to reach each point in space. Calculat- 


ing this is straightforward, solving a not-too-complicated 
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equation that has only one solution. 


Because this is a very clumsy device -- such an arm is unable 

to reach around obstacles or even to support an unobstructed 
object from an arbitrary direction -- we developed a much more 
mobile arm with many extra degrees of freedom. The new arm be- 
haves more like a tentacle than like a coordinate system. Hy- 
draulically operated, it is relatively slim and has five ‘articu- 
vations, each with two degrees of freedom, a shoulder, three 
elbows and a wrist. An articulated prototype shows clearly that 
this geometry is adequate for everything a human arm can do and 
more. In the actual hardware mechanism, each of the eight in- 
terior hinge joints has approximately 110 degrees of flexion. 

We conclude that this is inadequate; it must be more nearly 180. 
The lesson we learned is that in a minimal system a small angular 
restriction means only that the work space is somewhat reduced. 
But, in a linkage whose purpose is a great variety of ways to 
reach points in the work space, all restrictions interact in 
messy ways to break up the higher-dimensional mobility space 
into hard-to-understand fragments, so that two Slightly dif- 
ferent three-dimensional positions may have to be reached (if 

at all) by grossly different global arm configurations. For 
such a mechanism with 10 degrees of freedom for reaching points 
in only three dimensions, there is a mathematical problem of 


inverting the position equations. One can sometimes get by 
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with simple hill-climbing by successive approximations, but 

this is really unsatisfactory because it is hard to adjoin global 
information, and it does not give an analytic picture of the al- 
ternative solutions to the problem. Jacques-Yves Gresser, in 
his thesis discusses a method that gives a solution specific to 
our arm-mechanism, using a rather general method of dividing the 
Position problem into a series of almost-independent factor 
sub-problems. What Gresser does is to divide the arm in half, 
and describe the mobility space of eech half, using simplifying 
approximations for each side. When the half-solutions are put 
together, there’ needs to be an accuracy-increasing iteration 


anyway, so there is hardly any real loss through approximating. 


Now, in our arm-mechanism, it happens that the mobility zones of 
the half-arms can be described in terms understandable to humans 
-- for this particular arm, each zone resembles a portion of a 
torus. To find the ways to get the whole arm to a certain space 
position, one describes the intersection of the two tori, one 
centered at the shoulder position and the other the goal 
position. Each intersection point yields an approximate 
solution to the problem (that is, a possible location of the 
center of the arm), and a higher-level program could be asked 


which solution is most desirable. 


There is a larger problem here: how should one represent a 
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machine's body image? For the problem of a single, not-too- 
complicated arm, one can doubtless get by with cleverely coded, 
sparse, three-dimensional arrays, but one would like something 
more symbolic. And one wonders what happens in the nervous 
system; we have not seen anything that might be considered to be 
a serious theory. Consider that a normal human can place an 
object on a table, turn about and make a gross change in his 
position and posture, and then reach out and grasp within one 

or two inches of the object, all with his eyes closed! It 

seems unlikely that his cerebellum could perform the appropriate 
vector calculations to do this; it is not a mere matrix inver- 
sion (and, even if it were, one would still need a theory). 

We would presume that this complex motor activity is made up, 
somehow, of a large library of stereotypical programs, with 

some heuristic interpolation scheme that fits the required 
action to some collection of reasonably similar stored actions. 
But we have found nowhere any serious proposal about neurological 
mechanisms for this, and one can only hope that some plausible 
ideas will come out of robotics research itcelf. Unfortunately, 


at present this area is somewhat dormant. 


Another conclusion from our experiments is that delicate mani pu~ 
lations must be controlled by application of controlled forces 
(rather than by direct position control), with attention to 
matching velocities with inertias. For measuring the forces 


on a mechanical hand, we have taken two approaches. First, it is 
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quite feasible to engineer a arasping surface with good pressure 
sensitivity at a great many points by wrapping a coaxial cable 
connected to a Time-Domain Refiectometer. (A TDR is a sort of 
radar system designed to measure reflected radio waves that are 
Produced in a soft-sheathed tube, by any deformations of the wall. 
The instrument permits hundreds of thousands of points, using a 
single electrical connection to the hand. (Waltz describes this 
system in his thesis.) Second, we have developed a strain-guage 
telemetered wrist that arovidés force information in the necessary 
six degrees of freedom. This little instrument is able to tell 
where and in what direction the hand is being pushed, assuming 


that the force is being applied at a single point -- a condition 


hat _usually holds in a first contact with an object. 
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Solution of the appropriate moment equations for the set of six 
forces at the wrist (Fig. 30) yields a certain line in three- 

dimensional space, along which a force of a certain magnitude is 
applied. If the machine knows the geometry of its hand, it can 
Presume that the force is applied where this line intersects the 


hand. 


Fig.30 
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We have built this hand and are testing it. We shall describe it 


in a memo by Callahan, Shah and Minsky. 
Computer Eyes 


A computer must have eyes if it is to see. When the project 
Started, no device was avaiable for interfacing a computer with 
a real-time camera, although there were film-reading devices on 
the market. We first used a Vidicon television camera, but we 
had difficulties with the limited signal-to-noise quality of 

the Vidicon. Besides, Vidicons are not suitably adaptable to 
random-access scanning, they have motion persistence and other 
Problems. We decided that this added up to too many obstacles. 
The image-dissector camera offered much cleaner images (at the 
expense of sensitivity, which, for laboratory purposes, was 
unimportant) so, in collaboration with Information International, 
Inc., we designed and constructed our present camera system. 

An image dissector is essentially an inverted cathode-ray tube: 
an image projected on a photo-cathode produces electrons which 
are focused through a deflection system so that the current from 
a selected image point falls through a small hole and is then 


measured. 


In such a system, the Signal-to-noise ratio depends essentially 
on the number of electrons, and one can obtain more precision at 
the price of longer time-exposures. We decided to put this 


signal-to-noise ratio under direct program control and to make 
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it independent of the brightness of the point being measured. 


This is built into the camera's video processor. Horn gives the 


details ("The Image Dissector 'Eyes'", A.I. Memo 178). If the 


constant signal-to-noise constraint were enforced even on very 
dim points, the exposure times would become excessive, so the 
system has also a dark cut-off parameter that terminates the 
measurement very early if an initial estimate of the brightness 
falls below a specified value. This is also built in the video 
hardware. The resulting system can measure intensity over a 
4096:1 dynamic range, with a brightness-discrimination sensitivity 
of one part in 64 throughout this range. The output appears 

as a floating point number or as a logarithm of intensity. The 
eye has also a reference-brightness input that can be used to 
make the measurements almost independent of over-all illumination 
fluctuations, and there are various protective devices to prevent 
damage to the image tube by excessively bright light sources. 
Horn's report discusses many advantages and pitfalls in using 
this system; anyone contemplating using image dissectors for 
computer vision should correspond with us about the results of 


some modifications now under construction. 


It is frequently suggested that one ought to build various forms 
of parallel-processing artificial retinas, perhaps along the 
lines of biological systems. But the operations of the vertebrate 


retina and the subsequent image processing is not nearly so well 
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understood as is generally believed, and we do rot think 

the time is quite ripe for taking such a step in hardware. 

In our present operations, the computation time consumed in 

the higher levels of scene-analysis is comparable to that spent 
in the lower-level pre-processing, so at present there would 

be no large time factor to be gained from fast parallel hard- 


ware -- even if we were able to decide how it should work. 
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